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Preface 


These notes were written expressly for Mathematics 112 at Reed College, with first 
use in the spring of 2013. The title of the course is “Introduction to Analysis”. The 
prerequisite is calculus. Recently used textbooks have been Steven R. Lay’s “Analysis, 
With an Introduction to Proof” (Prentice Hall, Inc., Englewood Cliffs, NJ, 1986, 4th 
edition), and Ray Mayer’s in-house notes “Introduction to Analysis” (2006, available at 
http: //www.reed.edu/~mayer/mathi12.html/index.html). Ray Mayer’s notes strongly 
influenced the coverage in this book. 

In Math 112 at Reed College, students learn to write proofs while at the same time 
learning about binary operations, orders, fields, ordered fields, complete fields, complex 
numbers, sequences, and series. We also review limits, continuity, differentiation, and 
integration. My aim for these notes is to constitute a self-contained book that covers the 
standard topics of a course in introductory analysis, that handles complex-valued functions, 
sequences, and series, that has enough examples and exercises, that is rigorous, and is 
accessible to undergraduates. I maintain two versions of these notes, one in which the 
natural, rational and real numbers are constructed and the Least upper bound theorem 
is proved for the ordered field of real numbers, and one version in which the Least upper 
bound property is assumed for the ordered field of real numbers. You are reading the 
longer, former version. 

Chapter 1 is about how we do mathematics: basic logic, proof methods, and Pascal’s 
triangle for practicing proofs. Chapter 2 introduces foundational concepts: sets, Carte- 
sian products, relations, functions, binary operations, fields, ordered fields, Archimedean 
property for the set of real numbers. In Chapters 1 and 2 we assume knowledge of high 
school mathematics so that we do not practice abstract concepts and methods in a vac- 
uum. Chapter 3 through Section 3.8 takes a step back: we “forget” most previously learned 
mathematics, and we use the newly learned abstract tools to construct natural numbers, 
integers, rational numbers, real numbers, with all the arithmetic and order. I do not teach 
these constructions in great detail; my aim is to give a sense of them and to practice abstract 
logical thinking. The remaining sections in Chapter 3 are new material for most students: 
the field of complex numbers, and some topology. I cover the last section of Chapter 3 very 
lightly. Subsequent chapters cover standard material for introduction to analysis: limits, 
continuity, differentiation, integration, sequences, series, ending with the development of 
the power series )77° 5 a the exponential and the trigonometric functions. Since students 
have seen limits, continuity, differentiation and integration before, I go through chapters 4 
through 7 quickly. I slow down for sequences and series (the last three chapters). 

An effort is made throughout to use only what had been proved. For this reason, the 
chapters on differentiation and integration do not have the usual palette of trigonometric 
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and exponential examples of other books. The final chapter makes up for it and works out 
much trigonometry in great detail and depth. 

I acknowledge and thank the support from the Dean of Faculty of Reed College 
to fund exercise and proofreading support in the summer of 2012 for Maddie Brandt, 
Munyo Frey-Edwards, and Kelsey Houston-Edwards. I also thank the following people 
for their valuable feedback: Mark Angeles, Josie Baker, Marcus Bamberger, Anji Bodony, 
Zachary Campbell, Nick Chaiyachakorn, Safia Chettih, Laura Dallago, Andrew Erlanger, 
Joel Franklin, Darij Grinberg, Rohr Hautala, Palak Jain, Ya Jiang, Albyn Jones, Wil- 
low Kelleigh, Mason Kennedy, Christopher Keane, Michael Keppler, Ryan Kobler, Oleks 
Lushchyk, Molly Maguire, Benjamin Morrison, Samuel Olson, Kyle Ormsby, Angélica Os- 
orno, Shannon Pearson, David Perkinson, Jeremy Rachels, Ezra Schwartz, Jacob Sharkan- 
sky, Marika Swanberg, Simon Swanson, Matyas Szabo, Ruth Valsquier, Xingyi Wang, 
Emerson Webb, Livia Xu, Qiaoyu Yang, Dean Young, Eric Zhang, Jialun Zhao, and two 
anonymous reviewers. If you have further comments or corrections, please send them to 


irena@purdue. edu. 


The briefest overview, motivation, notation 


What are the meanings of the following: 


5+6 

7-9 

8—4 

4/5 

V2 

4-8 

1/3 = 0.333... 1=3-1/3 =0.999... 

(a-b)? =a? -b? 

(a+b)-(c+d) =ac+ad+ bc + bd 

(a+b)-(a—b) =a? —b? 

(a +5)? = a? + 2ab + b? 

Ja: Vb = Vab (for which a, b?) 

What is going on: /—4- /—9 = \/(—4)(—9) = V36 = 6, 
/—4-/-9 = 21-31 = -6 


You know all of the above except possibly the complex numbers in the last two rows, 


where obviously something went wrong. We will not resolve this last issue until later in 
the semester, but the point for now is that we do need to reason carefully. 

The main goal of this class is to learn to reason carefully, rigorously. Since one 
cannot reason in a vacuum, we will (but of course) be learning a lot of mathematics as 
well: sets, logic, various number systems, fields, the field of real numbers, the field of 
complex numbers, sequences, series, some calculus, and that e** = cosx + isin. 

We will make it all rigorous, i.e., we will be doing proofs. A proof is a sequence of 
steps that logically follow from previously accepted knowledge. 

But no matter what you do, never divide by 0. For further wise advice, turn to 
Appendix A. 
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[NOTATIONAL CONVENTION: 'TEXT BETWEEN SQUARE BRACKETS IN THIS FONT AND IN 
RED COLOR SHOULD BE READ AS A POSSIBLE REASONING GOING ON IN THE BACKGROUND 
IN YOUR HEAD, AND NOT AS PART OF FORMAL WRITING. | 


t1. Exercises with a dagger are invoked later in the text. 


*2. Exercises with a star are more difficult. 


Introduction to Analysis 


with Complex Numbers 


Chapter 1: How we will do mathematics 


1.1 Statements and proof methods 


Definition 1.1.1. A statement is a reasonably grammatical and unambiguous sentence 
that can be declared either true or false. 


Why do we specify “reasonably grammatical”? We do not disqualify a statement just 
because of poor grammar, nevertheless, we strive to use correct grammar and to express the 
meaning clearly. And what do we mean by true or false? For our purposes, a statement 
is false if there is at least one counterexample to it, and a statement is true if it has been 
proved so, or if we assume it to be true. 


Examples and non-examples 1.1.2. 
(i) The sum of 1 and 2 equals 3. (This is a true statement.) 
(ii) Seventeen. (This is not a statement.) 
(iii) Seventeen is the seventh prime number. (This is a true statement.) 


) 
) 
(iv) Is 2 positive? (This is not a statement.) 

(v) 1 =2.* (This is a false statement.) 

(vi) For every real number € > 0 there exists a real number 6 > 0 such that for all z, 
if 0 < |x —a| < 6 then z is in the domain of f and |f(x) — L| < «. (This isa 
statement, and it is (a part of) the definition of the limit of a (special) function f 
at a being L. Out of context, this statement is neither true or false, but we can 
prove it or assume it for various functions f.) 

(vii) Every even number greater than 4 can be written as a sum of two odd primes. 
(This statement is known as Goldbach’s conjecture. No counterexample is 
known, and no proof has been devised, so it is currently not known if it is true or 
false.) 

These examples show that not all statements have a definitive truth value. What 
makes them statements is that after possibly arbitrarily assigning them truth values, differ- 
ent consequences follow. For example, if we assume that (vi) above is true, then the graph 
of f near a is close to the graph of the constant function L. If instead we assume that 
(vi) above is false, then the graph of f near a has infinitely many values at some vertical 
distance away from L no matter how much we zoom in at a. With this in mind, even “I 
am good” is a statement: if I am good, then I get a cookie, but if I am not good, then you 


* ‘This statement can also be written in plain English as “One equals two.” In mathematics it is acceptable 
to use symbolic notation to some extent, but keep in mind that too many symbols can make a sentence hard to 
read. In general we avoid starting sentences with a symbol. In particular, do not make the following sentence. 
“=” is a verb. Instead make a sentence such as the following one. Note that “=” is a verb. 
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get the cookie. On the other hand, if “Hello” were to be true or false, I would not be able 
to make any further deductions about the world or my next action, so that “Hello” is not 
a statement, but only a sentence. 


A useful tool for manipulating statements is a truth table: it is a table in which the 
first few columns may set up a situation, and the subsequent columns record truth values of 
statements applying in those particular situations. Here are two examples of truth tables, 
where “7” of course stands for “true” and “F” for “false”: 


i | constant continuous differentiable everywhere 
f(a) =a F T T 
fC eae F aL F 
fast T T T 

x Yy | ry > 0 cy <0 ry <0 

eo >0 eae T F F 

x>O0 y <0 F T F 

x<0O y>0 F T T 

x<O y <0 F F F 


Note that in the second row of the last table, in the exceptional case y = 0, the 
statement ry < 0 is false, but in “the majority” of the cases in that row ry < 0 is true. 
The one counterexample is enough to declare xy < 0 not true, i.e., false. 


Statements can be manipulated just like numbers and variables can be manipulated, 
and rather than adding or multiplying statements, we connect them (by compounding the 


sentences in grammatical ways) with connectors such as “not”, “and”, “or”, and so on. 


Statement connecting: 

(1) Negation of a statement P is a statement whose truth values are exactly opposite 
from the truth values of P (under any specific circumstance). The negation of P 
is denoted “not P” (or ““P”). 

Some simple examples: the negation of “A = B” is “A # B”; the negation of 
“A < B” is “A> B”; the negation of “I am here” is “I am not here” or “It is not 
the case that I am here”. 

Now go back to the last truth table. Note that in the last line, the truth values 
of “xy > 0” and “xy < 0” are both false. But one should think that “xy > 0” 
and “xy < 0” are negations of each other! So what is going on, why are the 
two truth values not opposites of each other? The problem is of course that the 
circumstances x < 0 and y < 0 are not specific enough. The statement “xy > 0” 
is under these circumstances false precisely when y = 0, but then “xy < 0” is true. 
Similarly, the statement “xy < 0” is under the given circumstances false precisely 
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when y < 0, but then “xy > 0” is true. Thus, once we make the conditions specific 
enough, then the truth values of “xy > 0” and “xy < 0” are opposite, so that the 
two statements are indeed negations of each other. 

Conjunction of statements P and Q is a statement that is true precisely when 
both P and @ are true, and it is false otherwise. It is denoted “P and Q” or 
“P AQ”. We can record this in a truth table as follows: 


P Q| PandQ 
i= 6 T 
T F F 
F T F 
F F F 


Disjunction of statements P and Q is a statement that is false precisely when 
both P and Q are false, and it is true otherwise. We denote it as “P or Q” or as 
“PV Q”. In other words, as long as either P or Q is true, then Por Q is true. In 
plain language, unfortunately, we use “or” in two different ways: “You may take 
cream or sugar” says you may take cream or sugar or both, just like in the proper 
logical way, but “Tonight we will go to the movies or to the baseball game” implies 
that we will either go to the movies or to the baseball game but we will not do 
both. The latter connection of two sentences is in logic called exclusive or, often 
denoted xor. Even “either-or” does not disambiguate between “or” and “xor”. 
The truth table for the two disjunctions is: 


P @| PorQ PxorQ 


f. £ E F 
T F d§ sl 
F T a T 
F F F F 


Implication or a conditional statement is a statement of the form “P im- 


plies Q,” or variants thereof, such as all of the following: 


P implies Q. 

If P then Q. 

P is a sufficient condition for Q. 
P only if Q. 

Q if P. 

Q provided P. 

Q given P. 

Q whenever P. 

Q is a necessary condition for P. 
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“Given P, Q follows,” 

“Q@ whenever P”. 
P is called the antecedent and Q the consequent. A symbolic abbreviation is 
ae — od 0 
An implication is true when a true conclusion follows a true assumption, or when- 
ever the assumption is false. In other words, P = Q is false exactly when P is 
true and Q is false. 


Pee 1 |e Sa 
T T T 
T F F 
F T T 
F F T 


It may be counterintuitive that a false antecedent always makes the implication 
true. Bertrand Russell once lectured on this and claimed that if 1 = 2 then he 
(Bertrand Russell) was the pope. An audience member challenged him to prove 
it. So Russell reasoned somewhat like this: “If I am the pope, then the consequent 
is true. If the consequent is false, then I am not the pope. But if I am not the 
pope, then the pope and I are two different people. By assumption 1 = 2, so we 
two people are one, so I am the pope. Thus no matter what, I am the pope.” 
Furthermore, if 1 = 2, then Bertrand Russell is also not the pope. Namely, if he is 
not the pope, the consequent is true, but if he is the pope, then the pope and he 
are one, and since one equals two, then the pope and he are two people, so Russell 
cannot be the pope. 

A further discussion about why false antecedent makes the implication true is in 
the next discussion (5). 

Unfortunately, the implication statement is not used consistently in informal spo- 
ken language. For example, your grandmother may say: “You may have ice cream 
if you eat your broccoli” when she means “You may have ice cream only if you eat 
your broccoli.” Be nice to your grandmother and eat that broccoli even if she does 
not express herself precisely because you know precisely what she means. But 
in mathematics you do have to express yourself precisely! (Well, read the next 
paragraph. ) 


Even in mathematics some shortcuts in precise expressions are acceptable. Here 
is an example. The statements “An object x has property P if somethingorother 
holds” and “An object x has property P if and only if somethingorother holds” (see 
(5) below for “if and only if’) in general have different truth values and the proof 
of the second is longer. However, the definition of what it means for an object 
to have property P in terms of somethingorother is usually phrased with “if”, but 
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“if and only if? is meant. For example, the following is standard: “Definition: 
A positive integer strictly bigger than 1 is prime if whenever it can be written as 
a product of two positive integers, one of the two factors must be 1.” The given 
definition, if read logically precisely, since it said nothing about numbers such as 
4 = 2-2, would allow us to call 4 prime. However, it is an understood shortcut 
that only the numbers with the stated property are called prime. 

Equivalence or the logical biconditional of P and Q stands for the compound 
statement (P > Q)and(Q => P). It is abbreviated “P = Q” or “P iff Q”, and is 
true precisely when P and Q have the same truth values. 

For example, for real numbers x and y, the statement “x < y+ 1” is equivalent to 
“x —1<y.” Another example: “2x = 4x2” is equivalent to “x = 2x7,” but it is 
not equivalent to “1 = 2x.” (Say why!) 


We now backtrack on the truth values of P > Q. We can certainly fill in some 
parts without qualms, leaving some unknown truth values x and y: 


Po 30) || SPO? sO PoP eO 
aoe Au iy a 
ie 3h F x F 
F T De F F 
F F y y T 


Since the last column above is the conjunction of the previous two, the last line 
forces the value of y to be T. If x equals F’, then the truth values of P => Q are the 
same as the truth values of P © Q, which would say that the statements P > Q 
and P © Q are logically the same. But this cannot be: “If r > 0 then r > 0” is 
true whereas “If r > 0 then r > 0” is false. So this may convince you that the 
truth values for the third and the fifth column have to be distinct, and this is only 
possible if x is T. 


Here is the truth table for all the connectives so far: 


iP Q | not P PandQ, PorQd: £ xor' QO PSQ PSO 


T T F T T F A i A 
T F F F it T F F 
F T T F T ae ae F 
F F T F F F T T 


One can form more elaborate truth tables if we start not with two statements P 
and @ but with three or more. Examples of logically compounding P,Q, and R 
are: PandQand R, (PandQ) => Q, et cetera. For manipulating three statements, 
we would fill a total of 8 rows of truth values, for four statements there would be 
16 rows, and so on. 
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(6) Proof of P is a series of steps (in statement form) that establish beyond doubt that 


P is true under all circumstances, weather conditions, political regimes, time of 
day... The logical reasoning that goes into mathematical proofs is called deductive 
reasoning. Whereas both guessing and intuition can help you find the next step 
in your mathematical proof, only the logical parts are trusted and get written 
down. Proofs are a mathematician’s most important tool; the book contains many 
examples, and the next few pages give some examples and ideas of what proofs 
are. 


Ends of proofs are usually marked by LU, I //, or QED (for “quod erat demon- 


strandum”, which is Latin for “that which was to be proved”). It is a good idea to 
mark the completions of proofs especially when they are long or with many parts 
and steps — that helps the readers know that nothing else is to be added. 


The most trivial proofs simply invoke a definition or axiom, such as “An even 
integer is of the form 2 times an integer,” “An odd integer is of the form 1 plus 2 
times an integer,” or, “A positive integer is prime if whenever it can be written 


as a product of two positive integers, one of the two factors is 1.” 


Another type of proof consists of filling in a truth table. For example, P or ( not P) 
is always true, no matter what the truth value of P is, and this can be easily verified 
with the truth table: 

P | not P Por not P 


At F He 
F T T 


A formula using logical statements that is always true is called a tautology. 


So Por not P is a tautology. Here is another example of tautology: ((P => 
Q) and P) > Q, and it is proved below with the truth table: 


PQ|P3Q (P=>Q)andP ((P>Q)andP)>Q 


A ieee ‘iu ie iis 
T F F F 4% 
F T ae F Lr 
F F a F a8 


This particular tautology is called modus ponens, and its most famous example 
is the following: 

Every man is mortal. (If X is a man, then X is mortal.) 

Socrates is human. 

Therefore, Socrates is mortal. 
Here is a more mathematical example of modus ponens: 

Every differentiable function is continuous. 

f is differentiable. 


Section 1.1: Statements and proof methods 19 


Therefore, f is continuous. 


Another tautology is modus tollens: ((P > Q)and(notQ)) => (not P). To 
prove it, one constructs a truth table as before for modus ponens. — It is a 
common proof technique to invoke the similarity principle with previous work 
that allows one to not carry out all the steps, as I just did. However, whenever 
you invoke the proof-similarity principle, you better be convinced in your mind 
that the similar proof indeed does the job; if you have any doubts, show all work 
instead! In this case, I am sure that the truth table does the job, but if you are 
seeing this for the first time, you may want to do the actual truth table explicitly 
to get a better grasp on these concepts. 
Here is a mathematical example of modus tollens: 

Every differentiable function is continuous. 

f is not continuous. 

Therefore, f is not differentiable. 
Here is another example on more familiar ground: 

If you are in Oregon, then you are in the USA. 

You are not in the USA. 

Therefore, you are not in Oregon. 


Some proofs can be pictorial/graphical. Here we prove with this method that for 
any real numbers x and y, |z| < y if and only if —y < x < y. (We will see many 
uses of absolute values.) Proof: [FOR A BICONDITIONAL P = Q WE NEED TO 
PROVE P > Q AND Q => P.]| The assumption |x| < y implies that y must be 
positive, and the assumption —y < x < y implies that —y < y, which also says 
that y must be positive. So, with either assumption, we can draw the following 
part of the real number line: 


—y 0 y 


Now, by drawing, the real numbers x with |x| < y are precisely those real num- 
bers x with —y <a <_y. A fancier way of saying this is that |x| < y if and only if 


—ysr<y. 


Similarly, for all real numbers x and y, |x| < y if and only if -y <a <y. (Here, 
the word “similarly” is a clue that I am invoking the proof-similarity principle, 
and a reader who wants to practice proofs or is not convinced should at this point 
work through a proof by mimicking the steps in the previous one.) 


Some (or actually most) proofs invoke previous results without re-doing the previ- 
ous work. In this way we prove the triangle inequality, which asserts that for all 
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real numbers x and y, |x+y| < |z|+|y|. (By the way, we will use the triangle in- 
equality intensely, so understand it well.) Proof: Note that always —|x| < x < |a\, 
—|y| < +y < |y|. Since the sum of smaller numbers is always less than or equal to 
the sum of larger numbers, we then get that —|x| — |y| < «+y < |a|+4+ |y|. But 
—|a] —|yl = —(al + |yl), so that —(J2| + |y|) <@+y < |x| + |y|. But then by the 
previous result, ja + y| < |x| + ly]. 


Most proofs require a combination of methods. Here we prove that whenever x is a 
real number with |z—5| < 4, then |x? —32z| < 900. Proof: The following is standard 
formatting that you should adopt: first write down the left side of the desired 
inequality (|2° — 32|), then start manipulating it algebraically, in intermediate 
steps add a clever 0 here and there, multiply by a clever 1 here and there, 
rewrite, simplify, make it less than or equal to something else, and so on, every 
step should be either obvious or justified on the right, until at the end you get the 
quantity on the right (900): 
|a? — 32| < |x| + |32| (by the triangle inequality) 

= |z|° + 3]2| 

= |x —54+5|° + 3|a — 5 +5] (by adding a clever 0) 

< (|x — 5| + 5)? + 3(|2 — 5] + 5) 

(by the triangle inequality 
and since a < b implies that a? < b? ) 

< (4+5)? + 3(44+5) (since by assumption |x — 5] < 4) 

=9°+3-9 

= 9(97 +3) 

< 900. 


Here is a pictorial proof establishing the basis of trigonometry and the definition 


of slope as rise over run: namely that s = 
a 
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Proof: The areas of the big and small triangles are AB and sab, and the area of 
the difference is base times the average height, i.e., it is (A — q) Pte Thus 

b6+B 

“pr 

By multiplying through by 2 we get that AB = ab+ (A—a)(b+ B) =ab+ Ab+ 
AB—ab—aB, so that, after cancellations, Ab = aB. Then, after dividing through 
by aA we get 8 = 2, 


a 


1 1 
948 = gao+ (A —a) 


Another common method of proof of a statement P is proof by contradiction: 
you assume that not P is true and by using only correct logical and mathematical 
steps you derive some nonsense (contradiction). This then says that not P is false, 
so that not (not P) = P must be true. Beware: proofs by contradiction are in 
general not considered elegant, nevertheless, they can be very powerful. 

(Due to Pythagoras.) ./2 is not a rational number. (A number is rational if 
it is a ratio of two whole integers, where the denominator is not zero.) Proof by 
contradiction: Suppose that \/2 is rational. This means that /2 = % for some 
whole numbers a and b with b non-zero. Let d be the greatest common divisor of a 
and b. Write a = agd and b = bod for some integers ao, bp # 0. Then J2= $= Ba 
so bp\/2 = ag, and so by squaring both sides we get that 202 = 07. 80 G4 = 20718 
an even number, which by Exercise 1.1.12 means that ap is an even number. Write 
do = 2a, for some integer aj. Then 4a? = a? = 2b2, so that b2 = 2a? is even, 
whence again bo is even. But then 2 divides both ao and bo, so that 2d divides both 
a and 6, which contradicts the assumption that d was the greatest common divisor 


of a and b. Thus it is not the case that 2 is rational, so it must be irrational. 


Exercises for Section 1.1 


1.1.1. Determine and justify the truth value of the following statements. 


1 


) 
ii) 
iii) 
iv) 
v) 

) 


vi 


3 is odd or 5 is even. 

If n is even, then 3n is prime. 
If 3n is even, then n is prime. 
If n is prime, then 3n is odd. 
If 3n is prime, then n is odd. 
(PandQ) => P. 
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1.1.2. Sometimes statements are not written precisely enough. For example, “It is not the 


” 


case that 3 is prime and 5 is even” may be saying “not (3 is prime and 5 is even),” or it 
may be saying “(not (3 is prime)) and (5 is even).” The first option is true and the second 
is false. 
Similarly analyze several possible interpretations of the following ambiguous sentences: 

i) If 6 is prime then 7 is even or 5 is odd. 

ii) It is not the case that 3 is prime or if 6 is prime then 7 is even or 5 is odd. 
General advice: Write precisely; aim to not be misunderstood. 
1.1.3. Add the following columns to the truth table in (5): Pand not Q, (not P) and (not Q), 
(not P) or (notQ), P > not Q, (not P) > not Q. Are any of the new columns negations 
of the columns in the truth table (5) or of each other? 
1.1.4. Suppose that P = Q is true and Q is false. Prove that P is false. 
1.1.5. Prove that P > Q is equivalent to (not Q) => (not P). 


1.1.6. Simplify the following statements: 
ii Pond Pyorr. 

li) PSP? 

iii) (P and Q) or (PorQ). 
1.1.7. Prove with truth tables that the following statements are true. 
i) (PSQ)s [(P= Q)and(Q = P)|. 
ii) (P= Q) =i Q or not-P): 

iii) (PandQ) = [Pand(P = Q)). 

iv) |[P > (Qor R)| © [(Pand not Q) = R). 
1.1.8. Assume that Por@Q is true and that R > Q is false. Determine with proof the 
truth values of P,Q, R, or explain if there is not enough information. 


1.1.9. Assume that (PandQ) = R is false. Determine with proof the truth values of 
P,Q, R, or explain if there is not enough information. 


1.1.10. Suppose that x is any real number such that |v+2| < 3. Prove that |x3—3z| < 200. 
1.1.11. Suppose that x is any real number such that |x—1| < 5. Find with proof a positive 
constant B such that for all such x, |374 — 2z| < B. 
1.1.12. (Odd-even integers) 
i) Prove that the sum of two odd integers is an even integer. 
ii) Prove that the product of two integers is odd if and only if the two integers are 
both odd. 
iii) Suppose that the product of two integers is odd. Prove that the sum of those two 


integers is even. 
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iv) Suppose that the sum of the squares of two integers is odd. Prove that one of the 
two integers is even and the other is odd. 

v) Prove that the product of two consecutive integers is even. Prove that the product 
of three consecutive integers is an integer multiple of 6. 

vi) Prove that the sum of two consecutive integers is odd. Prove that the sum of three 
consecutive integers is an integer multiple of 3. 


1.1.13. (The quadratic formula) Let a,b,c be real numbers with a non-zero. Prove 
(with algebra) that all solutions x of the quadratic equation ax? + br + c = 0 are of the 


he —b+ Vb? — 4ac 


2a 


form 


1.1.14. Prove that /3 is not a rational number. (Remark: It is harder to prove that 7 
and e are not rational.) 


1.1.15. For which integers n is \/n not a rational number? 


1.1.16. (Cf. Exercise 10.2.3.) Draw a unit circle and a line segment from the center to the 
circle. Any real number x uniquely determines a point P on the circle at angle x radians 
from the line. Draw the line from that point that is perpendicular to the first line. The 
length of this perpendicular line is called sin(x), and the distance from the intersection of 
the two perpendicular lines to the center of the circle is called cos(x). This is our definition 
of cos and sin. 

Consider the following picture inside the circle of radius 1: 


i) Label the line segments of lengths sin(x), sin(y), cos(y). 
ii) Use ratio geometry (from page 20) to assert that the smallest vertical line in the 
bottom triangle has length sin(x) cos(y). 
iii) Use trigonometry and ratio geometry to assert that the vertical line in the top 
triangle has length sin(y) cos(z). 
iv) Prove that sin(x + y) = sin(x) cos(y) + sin(y) cos(a). 
1.1.17. Use an illustration similar to the one in the previous exercise to prove cos(x+y) = 


cos(x) cos(y) — sin(x) sin(y). 
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2 


1.1.18. Assuming that the area of the circle of radius r is mr“, convince yourself with 


proportionality argument that the area of the region below, where x is measured in radians, 


eee Sey) 
IS 547". 


+ 1.1.19. (Invoked in Theorem 10.2.5.) Let 2 be a small positive real number. Consider 
the following picture with a circular segment of radius 1 and two right triangles: 


i) Assert that the area of the small triangle is strictly smaller than the area of the 
wedge, which in turn is strictly smaller than the area of the big triangle. 


ii) Using the previous part and ratio geometry (from page 20), prove that 


$sin(x) cos(x) < da < $ tang. 
1 
ey cos x* 


iii) Using the previous part, prove that 0 < cos(xz) < 


*1.1.20. (Logic circuits) Logic circuits are simple circuits which take as inputs logical 
values of true and false (or 1 and 0) and give a single output. Logic circuits are composed 
of logic gates. Each logic gate stands for a logical connective you are familiar with— it could 
be and, or, or not (more complex logic circuits incorporate more). The shapes for logical 


and, or, not are as follows: 


D- D> 


Given inputs, each of these logic gates outputs values equal to the values in the associated 
truth table. For instance, an “and” gate only outputs “on” if both of the wires leading 
into it are “on”. From these three logic gates we can build many others. For example, the 


following circuit is equivalent to xor. 


A circuit that computes xor. The output on the right is on when 
exactly one of the inputs on the left is on. 
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Make logic circuits that complete the following tasks. (It may be helpful to make logic 
tables for each one.) 

i) xor in a different way than the circuit above. 
ii) “P implies Q” implication. 


) 

iii) xor for three inputs. 

iv) Is a 3-digit binary number greater than 2? 
) 


v) Is a 4-digit binary string a palindrome? 


1.2 Statements with quantifiers 


“The number x equals 1” is true for some x and false for some x. For determining 

a statement’s veracity we possibly need a further qualification. We can use the universal 
quantifier “for all”, “for every”, or the existential quantifier “there exists”, “for some”. 
The statement above could be modified to one of the following: 

(1) “There exists a real number x such that x = 1.” 

(2) “For all real numbers xz, x = 1,” which is logically the same as “For every real 

number x, x = 1,” and the same as “Every real number equals 1.” 

Certainly the first statement is true and the second is false. 


For shorthand we abbreviate “for all” with the symbol V, and “there exists” with 4. 
These abbreviations come in handy when we manipulate logical statements. The general 
forms of abbreviated statements with quantifiers are: 


“Ya with a certain specification, P(x) holds” = “Va P()” 


14 | 


dz with a certain specification, P(x) holds” = “dx P(x)” 


where P is some property that can be applied to objects x in question. The forms on the 
left have an explicit specifications on the scope of the x, and in the forms on the right the 
scope of the x is implicit. 


Warning: For ease of readability it may be better to write out full words rather than 
symbolic abbreviations. 

We read the displayed statements above as “for all 2, P of x [holds/is true]” and 
“there exists x such that P of x [holds/is true]”, respectively. The part “such that” only 
appears with the existential quantifier as a grammar filler but without any logical meaning; 
it can be replaced with “for which”, and can sometimes be shortened further. For example: 
“There exists a function f such that for all real numbers x, f(x) = f(—2)” can be rewritten 
with equal meaning as “There exists a function f that is defined for all real numbers and 
is even,” or even shorter as “There exists an even function.” (No “such that” appears in 
the last two versions. ) 
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Read the following symbolic statement (it defines the limit of the function f at a to 
be L; see Definition 4.1.1): 


Ve > 056 > 0Vz,0< |x-—al <6 => (|f(x) -—Ll<e. 


When are the truth values and the negations of statements with quantifiers? We first 
write a truth table with all the possible situations with regards to P in the first column, 
and other columns give the truth values of the quantifier statements: 


a possible situation Vo Pla) Ve not Play Avnot P(e). arP(z) 


there are no x of the T T F F 
specified type vacuously vacuously 


there are x of the speci- 
fied type, P true for all E F F a 


there are x of the speci- 
fied type, P false for all F T A F 


there are x of the spec- 
ified type, P true for 
some, false for some F F T T 


A “for all” statement is true precisely when without exception all x with the given 
description have the property P, and a “there exists” statement is true precisely when at 
least one x with the given description satisfies property P. One proves a “for all” statement 
by determining that each x with the given description has the property P, and one proves a 
“there exists” statement by producing one specimen x with the given description and then 
proving that that specimen has property P. If there are no x with the given specification, 
then any property holds for those no-things x vacuously. For example, any positive real 
number that is strictly smaller than —1 is also zero, equal to 15, greater than 20, product 
of distinct prime integers, and any other fine property you can think of. 

Notice that among the columns with truth values, one and three have opposite values, 
and two and four have opposite values. This proves the following: 


Theorem 1.2.1. The negation of “Vx P(x)” is “Ax not P(x)”. The negation of “dx P(x)” 
is “Va not P(x).” 


Thus “Va P(x)” is false if there is even one tiny tiniest example to the contrary. 
“Every prime number is odd” is false because 2 is an even prime number. “Every whole 
number divisible by 3 is divisible by 2” is false because 3 is divisible by 3 and is not divisible 
by 2. 
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Remark 1.2.2. The statement “For all whole numbers x between 1/3 and 2/3, x? is 
irrational” is true vacuously. Another reason why “For all whole numbers x between 1/3 
and 2/3, x? is irrational” is true is that its negation, “There exists a whole number x 
between 1/3 and 2/3 for which x? is rational”, is false because there is no whole number 
between 1/3 and 2/3: since the negation is false, we get yet more motivation to declare the 


original statement true. 


Exercises for Section 1.2 


1.2.1. Show that the statements are false by providing counterexamples. 
i) No number is its own square. 
All numbers divisible by 7 are odd. 


The square root of all real numbers is greater than 0. 


iv) For every real number 2, x* > 0. 


1.2.2. Determine the truth value of each statement. Justify your answer. 
For all real numbers a, b, (a + 6)? = a? + b?. 
For all real numbers a, b, (a + 6)? = a? + 2ab + b?. 


1 


ll 


iii) For all real numbers x < 5, x? > 16. 


iv) There exists a real number x < 5 such that x? < 25. 


v) There exists a real number x such that x2 = —4. 


vi) There exists a real number x such that x? = —8. 


) 
) 
) 
) 
) 
) 
vii) For every real number « there exists a positive integer n such that x” > 0. 
viii) For every real number «x and every integer n, |a| < x”. 
ix) For every integer m there exists an integer n such that m+ n is even. 
x) There exists an integer m such that for all integers n, m+ n is even. 
) 
) 
) 


2 _ n is even. 


xi) For every integer n, n 
xii) Every list of 5 consecutive integers has one element that is a multiple of 5. 


xiii) Every odd number is a multiple of 3. 


1.2.3. Explain why the following statements have the same truth values: 
i) There exists x such that there exists y such that P holds for the pair (x, y). 
ii) There exists y such that there exists x such that P holds for the pair (z, y). 
iii) There exists a pair (x,y) such that P holds for the pair (2, y). 
1.2.4. Explain why the following statements have the same truth values: 
i) Vx > 0 Vy > 0,2y > 0. 
ii) Vy >O0Va>0,ry > 0. 
iii) V(x, y), x,y > 0 implies zy > 0. 
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1.2.5. (Contrast with the switching of quantifiers in the previous two exercises.) Explain 
why the following two statements do not have the same truth values: 

i) For every x > 0 there exists y > 0 such that ry = 1. 

ii) There exists y > 0 such that for every x > 0, ry = 1. 


1.2.6. Rewrite the following statements using quantifiers: 


i) 7 is prime. 


ii) There are infinitely many prime numbers. 


iii) Everybody loves Raymond. 


1V 


So NES RES AL, 


Spring break is always in March. 


1.2.7. Let “xLy” represent the statement that x loves y. Rewrite the following statements 
symbolically: “Everybody loves somebody,” “Somebody loves everybody,” “Somebody is 
loved by everybody,” “Everybody is loved by somebody.” At least one statement should 


be of the form “Vx dy, xLy”. Compare its truth value with that of “da Vy, xLy”. 


1.2.8. Find a property P of real numbers z, y, z such that “Vy dx Vz, P(x,y,z)” and 


“Wz da Vy, P(x, y,z)” have different truth values. 


1.2.9. Suppose that it is true that there exists x of some kind with property P. Can we 
conclude that all x of that kind have property P? (A mathematician and a few other jokers 
are on a train and see a cow through the window. One of them generalizes: “All cows in 
this state are brown,” but the mathematician corrects: “This state has a cow whose one 
side is brown.” ) 


1.3 More proof methods 


When statements are compound, they can be harder to prove. Fortunately, proofs 
can be broken down into simpler statements. An essential chart of this breaking down is 
in the chart on the next page. 


Example 1.3.1. Integers 2 and 3 are prime, i.e., 2 is a prime integer and 3 is a prime 
integer. 


Proof. Let m and n be whole numbers strictly greater than 1. If m-n = 2, then 1 < 
m,n <2,som=n= 2, but 2-2 is not equal to 2. Thus 2 cannot be written as a product 
of two positive numbers different from 1, so 2 is a prime number. If instead m-n = 3, then 
1< m,n <3. Then all combinations of products are 2 -2,2-3,3-2,3-3, none of which 


is 3. Thus 3 is a prime number. 
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Statement 


How to prove it 


P (via contradiction). 


Suppose not P. Establish some nonsense 
(that makes not P thus impossible so that 
P must hold). 


PandQ. 


Prove P. Prove Q. 


Por Q. 


Suppose that P is false. Then prove Q. 
Alternatively: Suppose that Q is false and 
then prove P. 

(It may even be the case that P is true 
always. Then simply prove P. Or simply 
prove Q.) 


If P then Q. 


Suppose that P is true. Then prove Q. 
Contrapositively: Suppose that Q is false. 
Prove that P is false. 


Pe Q, 


Prove P > Q. Prove Q => P. 


For all x of a specified 
type, property P holds for 
ios 


Let x be arbitrary of the specified type. 
Prove that property P holds for x. 
(Possibly break up into a few subcases.) 


There exists x of a speci- 
fied type such that prop- 
erty P holds for «x. 


Find/construct an x of the specified type. 
Prove that property P holds for x. 
Alternatively, invoke a theorem guarantee- 
ing that such x exists. 


An element x of a speci- 
fied type with property P 


is unique. 


Suppose that x and 2’ are both of spec- 
ified type and satisfy property P. Prove 
thet Sa 

Alternatively, show that x is the only so- 
lution to an equation, or the only element 


on a list, or .... 


x with property P is 


unique. 


Suppose that x and y have property P. 
Prove that x = y. 


29 


Example 1.3.2. A positive prime number is either odd or it equals 2. (Often the term 


“prime” implicitly assumes positivity, but —2 can be thought of as a prime number as 
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well.) 


Proof. Let p be a positive prime number. Suppose that p is not odd. Then p must be even. 
Thus p = 2-q for some positive whole number q. Since p is a prime, it follows that q = 1, 
so that p= 2. 


Example 1.3.3. If an integer is a multiple of 2 and a multiple of 3, then it is a multiple 
of 6. (Implicit here is that the factors are integers.) 


Proof. Let n be an integer that is a multiple of 2 and of 3. Write n = 2-p and n = 3-q for 
some integers p and gq. Then 2-p=3-q is even, which forces that g must be even. Hence 


q = 2-r for some integer r, so that n = 3-q=3-2-r=6-r. Thus n is a multiple of 6. 


Example 1.3.4. For all real numbers z, x? = (—x)?. 


Proof. Let x be an arbitrary real number. Then (—x)? = (—2x)(—x) = (—1)x(-1)z = 
(-1)(-lheaxw = 1+ 27 = 27. 


Example 1.3.5. There exists a real number x such that x? — 3x2 = 2. 


Proof. Observe that 2 is a real number and that 2° — 3-2 = 2. Thus x = 2 satisfies the 


conditions. 


Example 1.3.6. There exists a real number x such that 2° — x = 1. 


Proof. Observe that f(x) = 2° — x is a continuous function. Since 1 is strictly between 
f(0) = 0 and f(2) = 6, by the Intermediate value theorem (Theorem 5.3.1 in this book) 
[INVOKING A THEOREM RATHER THAN CONSTRUCTING 2, AS OPPOSED TO IN THE PREVI- 
OUS EXAMPLE]* there exists a real number x strictly between 0 and 2 such that f(a) = 1. 


Example 1.3.7. (Mixture of methods) For every real number wz strictly between 0 and 1 


there exists a positive real number y such that 4 a 7 = oat 


Proof. [WE HAVE TO PROVE THAT FOR ALL x AS SPECIFIED SOME PROPERTY HOLDS. | 
Let x be in (0,1). [FOR THIS x WE HAVE TO FIND y ...] Set y= 1-— 2. [WAS THIS 
A LUCKY FIND? NO MATTER HOW WE GOT INSPIRED TO DETERMINE THIS y, WE NOW 
VERIFY THAT THE STATED PROPERTIES HOLD FOR x AND y.] Since a is strictly smaller 
than 1, it follows that y is positive. Thus also xy is positive, and y+ «= 1. After dividing 

1) 2 ye 1 


the last equation by the positive number xy we get that 7 T= ogg ae 


Furthermore, y in the example above has no choice but to be 1 — x. 


* Recall that this font in brackets and in red color indicates the reasoning that should go on in the 
background in your head; these statements are not part of a proof. 
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Example 1.3.8. (The Fundamental theorem of arithmetic) Any positive integer 
n > 1 can be written as p{'p5”---p;* for some positive prime integers py < --- < px and 
some positive integers a1,...,a@,%. (Another standard part of the Fundamental theorem of 
arithmetic is that the p; and the a; are unique, but we do not prove that. Once you are 


comfortable with proofs you can prove that part yourself.) 


Proof. (This proof is harder; it is fine to skip it.) Suppose for contradiction that the 
conclusion fails for some positive integer n. Then on the list 2,3,4,...,72 let m be the 
smallest integer for which the conclusion fails. If m is a prime, take k = 1 and py = m, 
a, = 1, and so the conclusion does not fail. Thus m cannot be a prime number, and so m = 
m Mz for some positive integers m1, Mz strictly bigger than 1. Necessarily 2 << m ,,mz2 < m. 
By the choice of m, the conclusion is true for m; and mz. Write m1 = p{'ps?--- pz" and 


M2 = Gg? ee qh" for some positive prime integers py <--: < Dr, qi < +++ < dp and some 
positive integers a1,...,Q@%,61,...,6;. Thus m = pat pa? . «pak qPt gb? ... gt is a product 


of positive prime numbers, and after sorting and merging the p; and q;, the conclusion 
follows also for m. But we assumed that the conclusion fails for m, which yields the 


desired contradiction. Hence the conclusion does not fail for any positive integer. 


Example 1.3.9. Any positive rational number can be written as —, where a and b are 
positive whole numbers and where in any prime factorizations of a and 6 as in the previous 


example, the prime factors for a are distinct from the prime factors for b. 


Proof. [WE HAVE TO PROVE THAT FOR ALL ...| Let x be a(n arbitrary) positive rational 
number. [WE REWRITE THE MEANING OF THIS IN A MORE CONCRETE AND USABLE 
FORM NEXT.| Thus x = ¢ for some whole numbers a, b. If a is negative, since x is positive 
necessarily b has to be negative. But then —a, —b are positive numbers, and x = =. Thus 
by possibly replacing a, b with —a, —b we may assume that a,b are positive. [A REWRITING 
TRICK.] There may be many different pairs of a,b, and we choose a pair for which a is the 
smallest of all possibilities. [A CHOOSING TRICK. BUT DOES THE SMALLEST a EXIST?] 
Such a does exist because in a collection of given positive integers there is always a smallest 
one. Suppose that a and b have a (positive) prime factor p in common. Write a = agp and 
b = bop for some positive whole numbers ao, bo. Then x = Be and since 0 < ag < a, this 
contradicts the choice of the pair a,b. Thus a and b could not have had a prime factor in 


common. 


Exercises for Section 1.3 


1.3.1. Prove that every whole number is either odd or even. 
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1.3.2. 
1.3.3. 
1.3.4. 
1.3.5. 
1.3.6. 
1.3.7. 
1.3.8. 
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Prove that the successor of any odd integer is even. 
Prove that if n is an even integer, then either n is a multiple of 4 or n/2 is odd. 
Prove that if 0 <2 <1, then 22 <2 < 4/2. 


Prove that if 1 <a, then “2 < 2 < 2”. 


1 


Prove that there exists a real number x such that ©? + 22 = |, 


Find at least three functions f such that for all real numbers x, f(x?) = 2”. 


Prove that f(x) =z is the unique function that is defined for all real numbers and 


that has the property that for all x, f(x?) = 2°. 


1.3.9. 


Prove that there exists a real number y such that for all real numbers x, ry = y. 


1.3.10. Prove that there exists a real number y such that for all real numbers x, xy = x. 


1.3.11. Prove that for every real number «x there exists a real number y such that x+y = 0. 


1.3.12. Prove that there exists no real number y such that for all real numbers x, x+y = 0. 


1.3.13. Prove that for every real number y there exists a real number x such that x+y 4 0. 


1.4 Logical negation 
In order to be able to prove statements effectively, we often have to suppose the 

negation of a part, say for proving statements with “or” and for proofs by contradiction. 
Work and think through the following negations: 

Statement Negation 

not P i 

P andQ (not P) or (not Q) 

Por gd (not P) and (not Q) 

Pa) P and (not Q) 

Peo P = (not Q) = (not P) SQ 

For all x of the specified type, There exists x of the specified 

property P holds for x. type such that P is false for «x. 

There exists x of the specified type | For all x of the specified type, 

such that property P holds for 2. P is false for x. 


Warning: The negation of a conditional statement is not another conditional statement! 


Practice this one! 
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Example 1.4.1. There are infinitely many (positive) prime numbers. Proof by contradic- 
tion: (Due to Euclid.) Suppose that there are only finitely many prime numbers. Then 
we can enumerate them all: p1,po,...,p~n. Let a@ = (pipa:::pn) +1. Since we know 
that 2,3,5 are primes, necessarily n > 3 and soa > 1. By the Fundamental theorem of 
arithmetic (Example 1.3.8), a has a prime factor p. Since pi, p2,...,Pn are all the primes, 
necessarily p = p; for some 7. But then p = p; divides a and p,p2--- pn, whence it divides 
1 = a— (pip2-::Pn), which is a contradiction. So it is not the case that there are only 


finitely many prime numbers, so there must be infinitely many. 


Another proof by contradiction: (Due to R. MeStrovi¢é, American Mathematical 
Monthly 124 (2017), page 562.) Suppose that there are only finitely many prime numbers. 
Then we can enumerate them all: p, = 2, p2 = 3,...,pn. The positive integer p2p3 --+ Pn —2 
has no odd prime factors, and since it is odd, it must be equal to 1. Hence pop3--- pp, = 3, 


which is false since p2 = 3, p3 = 5, and so on. 


Exercises for Section 1.4 


1.4.1. Prove that the following pairs of statements are negations of each other: 
i) P SQ. 

(P and Q) or (not Pand not Q). 

ii) Pand(P > Q). 
not Por not Q. 

iii) (P > notQ)and(R= Q). 
(P and Q) or (Rand not Q). 

iv) Pand (Q or not R). 
P = (not Qand R). 


1.4.2. Why are “f is continuous at all points” and “f is not continuous at 3” not negations 
of each other? 


1.4.3. Why are “Some continuous functions are differentiable” and “All differentiable func- 
tions are continuous” not negations of each other? 


1.4.4. Why is “P => not Q” not the negation of “P > Q”? 


1.4.5. Negate the following statements: 
i) The function f is continuous at 5. 
ii) Ife > y then z > z. 
iii) For every € > 0, there exists 6 > 0 such that f(d) =. 
iv) For every € > 0, there exists 6 > 0 such that for all x, f(a -6) = ex. 
v) For every € > 0, there exists 6 > 0 such that for all x, 0 < |x — a| < 6 implies 


|f(z) —L| <e. 
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1.4.6. Prove that the following are false: 
i) For every real number x there exists a real number y such that ry = 1. 


ii) 5” + 2 is prime for all non-negative integers n. 


) 
) 
iii) For every real number 2, if x? > 4 then x > 2. 
iv) For every real number 2, Vx? = z. 
v) The cube of every real number is positive. 

) 


vi) The square of every real number is positive. 


1.5 Summation 


There are many reasons for not writing out the sum of the first hundred numbers in 
full length: it would be too long, it would not be any clearer, and we might start doubting 
the intelligence of the writer. Instead we express such sums with the summation sign &: 


100 100 


a k or iS n. 
k=1 n=1 


The counters k and n above are dummy variables, they vary from 1 to 100, as indicated 
below and above the summation sign. We could use any other name in place of k or n. 
In general, if f is a function defined at m,m-+1,m + 2,...,n, we use the summation 
shortening as follows: 


d= F(R) = f(r) + fm +1) + fm +2) +--+ f(r). 
k=m 


This is one example where good notation saves effort and often clarifies the concept. For 
typographical reasons, to prevent lines jamming into each other, we also write this as 
rn F(R). 

Now is a good time to discuss polynomials. A polynomial function is a function 
of the form f(x) = agp + a,4 +--+: + a,x" for some non-negative integer n and some 
numbers ao, @1,...,@n. We call ag + ayx +--+++ay2” a polynomial, and if a, is non-zero, 
we say that the polynomial has degree n. (More on the degrees of polynomial functions 
is in Example 1.6.5, Exercise 2.6.15.) It is convenient to write this polynomial with the 
shorthand notation 


f(@) =a9 +aye+---+an2" = aun 
k=0 


Here, of course, x° stands for 1. When we evaluate f at 0, we get ag = an +a,-0+:--+a,,-0" 
= rp) ax0", and we deduce that notationally 0° stands for 1 here. 
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Remark 1.5.1. 0° could possibly be thought of also as lim 0”, which is surely equal to 0. 
x—0 


But then one can wonder whether 0° equals 0 or 1 or to something else entirely? Well, it 

turns out that 0° is not equal to that zero limit — you surely know of other functions f for 

which lim f(a) exists but the limit is not equal to f(c). (Check out also Exercise 7.6.9.) 
wc 


Examples 1.5.2. 
5 


(1) $)2=24+2424242=10. 
k=1 


5 

(2) S0k=142434+445=15. 
k=1 
4 

(3) Sok? = 1742? +37 44? = 30. 


k=1 
12 


(4) MS cos(k7) = cos(107) + cos(117) + cos(127) =1-1+1=1. 


k=10 
2 2 
(5) So (4k*) =-440444+4-8=32 =4(-14+04141-8)=4 S> B®. 
k=-1 k=-1 


n 


(6) S° 3 = 3 added to itself n times = 3n. 


k=1 
b 


(7) ye = 2 added to itself b-—a+1 times = 2(b—a+1). 
k=a 


We can even deal with empty sums such as oe dz: here the index starts at k = 1 
and keeps increasing and we stop at k = 0, but there are no such indices k. What could 
possibly be the meaning of such an empty sum? Note that 


4 2 4 1 4 0 4 
So ax = So ax + Scan = » Ge + So ax — ar + Soap, 
k=1 k=1 k=3 k=1 k=2 k=1 k=1 
or explicitly written out: 


ay + a2 + a3 + a4 = (a4 + az) + (a3 + a4) 
= (a1) + (a2 + ag + a4) 
= ()+ (a1 + a2 +.a3 +. a4), 


from which we deduce that this empty sum must be 0. Similarly, every empty sum equals 0. 


Similarly we can shorten products with the product sign IT: 


I] fe) = fm) - fm +1) - f(m +2) +--+ f(n). 
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In particular, for all non-negative integers n, the product [],;_,& is used often and is 
abbreviated as n! = [[j-_, k. See Exercise 1.5.6 for the fact that 0! = 1. 


Exercises for Section 1.5 

1.5.1. Compute pes: +1), Se +2). 

1.5.2. Determine all non-negative integers n for which }7)) k = o¢_g n- 
1.5.3. Prove that 


ibe > fk) = So cf(k). 
k=m 


k=m 
n n 


ii) S° f(k) + So afk) = SS (F(R) + g(®)). 
k=m k=m 


1.5.4. Prove that \)_,k = ote 


m—1 n n 
1.5.5. Prove that for all integers m < n, S> f(k) + Ss f(k) = ys f(k). 
k=1 k=m k=1 


1.5.6. Prove that the empty product equals 1. In particular, we can declare that 0! = 1. 
This turns out to be very helpful notationally. 


1.5.7. Prove: 


i) [] 2=32. 
k=1 
5 
ii) |] # = 120. 


k=1 
1.6 Proofs by (mathematical) induction 


So far we have learned a few proof methods. There is another type of proofs that 
deserves special mention, and this is proof by (mathematical) induction, sometimes 
referred to as the principle of mathematical induction. This method can be used when 
one wants to prove that a property P holds for all integers n greater than or equal to an 
integer no. Typically, no is either 0 or 1, but it can be any integer, even a negative one. 

Induction is a two-step procedure: 

(1) Base case: Prove that P holds for no. 
(2) Inductive step: Let n > no. Assume that P holds for all integers no, ng +1, 29 + 
2,...,n—1. Prove that P holds for n. 

Why does induction succeed in proving that P holds for all n > ng? By the base 

case we know that P holds for no. The inductive step then proves that P also holds for 
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no +1. So then we know that the property holds for no and np + 1, whence the inductive 
step implies that it also holds for no + 2. So then the property holds for no, no + 1 and 
no +2, whence the inductive step implies that it also holds for n9 +3. This establishes that 
the property holds for ng, no +1, No +2, and np +3, so that by inductive step it also holds 
for no + 4. We keep going. For any integer n > no, in n — ng step we similarly establish 
that the inductive step holds for no, no + 1, no + 2,...,No + (Nn — no) = n. Thus for any 
integer n > no, we eventually prove that P holds for it. 
The same method can be phrased with a slightly different two-step process, with the 
same result, and the same name: 
(1) Base case: Prove that P holds for no. 
(2) Inductive step: Let n > no. Assume that P holds for integer n — 1. Prove that 
P holds for n. 
Similar reasoning as in the previous case also shows that this induction principle 
succeeds in proving that P holds for all n > no. 


Example 1.6.1. Prove the equality \°/_, k = mint) for alln > 1. 


Proof. Base case n = 1: The left side of the equation is y 3 k; which equals 1. The right 
side is 1+) which also equals 1. This verifies the base case. 

Inductive step: Let n > 1 and we assume that the equality holds for n — 1. [WE 
WANT TO PROVE THE EQUALITY FOR n. WE START WITH THE EXPRESSION ON THE 
LEFT (MESSIER) SIDE OF THE DESIRED AND NOT-YET-PROVED EQUATION FOR n AND 


MANIPULATE THE EXPRESSION UNTIL IT RESEMBLES THE DESIRED RIGHT SIDE. | Then 
n n—-1 
Ske 62 ‘55 
k=1 k=1 


_ (n—1)(n-—1+4+1) 


+n _ (by induction assumption for n — 1) 


2n 
ee + 5: (by algebra) 


as was to be proved. 


We can even prove the equality )°;_, k = mint) for all n > 0. Since we have already 


proved this equality for all n > 1, it remains to prove it for n = 0. The left side pean k; is 


0(0+1) 
2 


an empty sum and hence 0, and the right side is , which is also 0. 
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Example 1.6.2. Prove the equality )77_, k(k + 1)(k + 2)(k +3) = n(n+1)(n4 2Y(n +3)(n+4) 
for alln > 1. 


Proof. Base case n = 1: )>,_, K(k +1)(kK+2)(K+3) = 1114+1)(14+2)(14+3) =1-2-3-4= 
1-2-3-4-5 __ 1(1+1)(142)(1+3)(1+4) 
5 5 


, which verifies the base case. 

Inductive step: Let n > 1 and we assume that the equality holds for n — 1. [WE 
WANT TO PROVE THE EQUALITY FOR n. WE START WITH THE EXPRESSION ON THE 
LEFT SIDE OF THE DESIRED AND NOT-YET-PROVED EQUATION FOR 7 (THE MESSIER OF 
THE TWO) AND MANIPULATE THE EXPRESSION UNTIL IT RESEMBLES THE DESIRED RIGHT 
SIDE.| Then 


S Teh + 1)(k+2)(k+3) = 
k=1 


© k(k + 1)(k + 2)(k +9) +n(n+1)(n + 2)(n +3) 
k=1 
n-I(n—-1+)(n—1+2)(n—-1+43)(n—-14+4) 
5 
+ n(n +1)(n + 2)(n +3) 


(by induction assumption) 
(n —1)n(n +1)(n+ 2)(n + 3) i 5n(n + 1)(n+ 2)(n+3) 


5 5 
= mut Jin a) tha) (n—1+5) (by factoring) 
n(n + 1)(n +2)(n + 3)(n + 4) 


5 d 


as was to be proved. 


Example 1.6.3. Assuming that the derivative of x is 1 and the product rule for deriva- 


n-1 


tives, prove that for alln > 1, 4(2”") = naz (We introduce derivatives formally in 


Section 6.1.) 


Proof. We start the induction at n = 1. By calculus we know that the derivative of x! is 
1=1-2° =1-2'~!, so equality holds in this case. 
Inductive step: Suppose that equality holds for 1,2,...,2—1. Then 


d n\ __ d n-1 

Gola me 
_ d n-1 d n-1 
= Wg) x + ae (x ) 


(by the product rule of differentiation) 


=1-2%14(n-1)2-2"? 
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(by induction assumption for 1 and n — 1) 


=z" "+(n—-1)a"" 


n-1 


= NX 


The following result will be needed many times, so remember it well. 
Example 1.6.4. For any number z and any integer n > 1, 
(=-a)((+e+¢? +e% +. +2") S107", 
Proof. When n = 1, 
(=e) (+o +2? He? bee pg") = a) ee) Sl — 2? S12 
which proves the base case. Now suppose that equality holds for some integer n — 1 > 1. 
Then 
(l-a)(lta+a?+---+2" 142") 

=(l—a2)((l+v4+2?4+---+2""1) +2") 
=(=e)Odtata? feta *)+ (l—2)2” 
=1—2"+2"—2"*! (by induction assumption and algebra) 


a n+1 
Shay 


which proves the inductive step. 


Example 1.6.5. (Euclidean algorithm) Let f(z) = a9 + a1” +---+ a,x” for some 
numbers ag, @1,.--,@, and with a, 4 0, and let g(x) = bb) + bia 4+---+b,2™ for some 
numbers bo, b1,...,bm and with b,, 4 0. Suppose that m,n > 1. Then there exist polyno- 
mials g(a) and r(x) such that f(#) = q(x) - g(x) + r(x) and such that the degree of r(x) is 
strictly smaller than m. 


Proof. We keep g(x) fixed and we prove by induction on the degree n of f(x) that the claim 
holds for all polynomials f(x). If nm < m, then we are done with q(x) = 0 and r(x) = f(z). 
Ifn =m, then we set q(x) = § and (necessarily) 


am 


r(2) = f(e) - F™9(e) 


=a + oie tana” = 5 (bo + bre +--+ Bm) 
= (ao - 5b) + (a1 — Fe bi)e ++++4+ (@m—1 —- Feb —1)a", 


which has degree strictly smaller than m. These are the base cases. 
Now suppose that n > m. Set h(x) = a, + ao” +0327 ++--+a,2"—1. By induction 
on n, there exist polynomials q,(x) and r;(x) such that h(a) = qi(x)-g(a)+1r1(x) and such 
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that the degree of r1(x) is strictly smaller than m. Then xh(x) = xqi(x) - g(a) + wri (2). 
Since the degree of xr1(x) is at most m, by the second base case there exist polynomials 
g2(x) and r2(x) such that xri(x) = go(x)g(x) + r2(x) and such that the degree of r2(x) is 
strictly smaller than m. Now set q(x) = rqi(x) + qo(x) and r(x) = r2(x) + ap. Then the 
degree of r(x) is strictly smaller than m, and 


q(x) g(a) + r(x) = qi (x) g(x) + g(x) g(x) + r2(x) + ao 
= 1g (2) g(x) + xri(x) + ao 
= rh(x) + ao 
= f(a). 


Remark 1.6.6. A common usage of the Euclidean algorithm is in finding the greatest 
common divisor of two polynomials. A polynomial d(a) divides f(x) and g(x) exactly 
when it divides g(x) and r(x) = f(x)—q(x)-g(). It is easier to find factors of polynomials 
of smaller degree. As an example, let f(x) = 24+42°+6x7+42+4+1 and g(x) = 23+227+2. 
The first step of the Euclidean algorithm gives 


r(a) = f(z) = (@+2)9(@) S27 +22 +1. 


So to find the greatest common divisor of f(a) and g(x) it suffices to find the greatest 
common divisor of g(a) and r(#). The Euclidean algorithm on these two gives ri(x) = 
g(x) — xr(x) = 0, so that to find the greatest common divisor of f(x) and g(x) it suffices 
to find the greatest common divisor of r(a) and 0. But the latter is clearly r(x). In fact, 
f(x) = (x +1)4 and g(x) = 2(x 4+ 1)?. 


Example 1.6.7. For all positive integers n, ~/n < 2. 


Proof. Base case: n = 1, so Yn = 1 < 2. 
Inductive step: Suppose that n is an integer with n > 2 and that "Wn-1 < 2. 
This means that n—1 < 2"-1. Hence n < 27-141 < 27-142"-1=2.2"-1 = 2". so that 


iD: 


Remark 1.6.8. There are two other equivalent formulations of mathematical induction 
for proving a property P for all integers n > no: 
Mathematical induction, version ITI: 
(1) Base case: Prove that P holds for no. 
(2) Inductive step: Let n > no. Assume that P holds for all integers no,no + 
1,no + 2,...,n. Prove that P holds for n+ 1. 


Mathematical induction, version IV: 
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(1) Base case: Prove that P holds for no. 
(2) Inductive step: Let n > no. Assume that P holds for integer n. Prove that P 
holds for n + 1. 
Convince yourself that these two versions of the workings of mathematical induction 
differ from the original two versions only in notation. 


Exercises for Section 1.6: Prove the following properties for n > 1 by induction. 


et. ie —— 


1.6.2. y= (* wee 


1.6.3. ae sum of the first n odd positive integers is n°. 


n 


1.6.4. S°(2k-1) =n? 


k=1 


1.6.5. (Triangle inequality) For all positive integers n and for all real numbers aj,..., Qn, 
Jay tag +---+ay| < laz|+|a2|+---+ |a,|. (Hint: there may be more than one base case. 
Why is that?) 


1.6.6. 57(3k? —k)=n?(n +1). 


1.6.7. 1-24+2-34+3-44+---4+n(n+]1)= sn(n + 1)(n +2). 
1.6.8. 7” +2 is a multiple of 3. 
1.6.9. 3°"! < (n+). 


1 il 1 
1.6.10. + 4+ 4 +--+ SP Vn. 
vl v2 v3 ae 
1.6.11 : + : ae : 5 ee : <2 : 
ee de De, Oe ae 
1.6.12. Let a; = 2, and for n . 2, An = 3a,_1. Formulate and prove a theorem giving a,, 


in terms of n (no dependence on other oa 

1.6.13. 8 divides 5" + 2-3"7! +1. 

1.6.14. 1(1!) + 2(2!) + 3(3!) +--+ + n(n!) =(n+1)!-1. 
1-621552"" oy), 


= 1 1 
1.6.16. i a 
We yee 
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~ 1 n+1 
1.6.17. 1 a , 
eat. I (1- ga) = "ap 


1.6.18. 5° 2*(k+ 1) =2"*'n+1. 
k=0 


16.19. 5,2 Se. 
+1.6.20. (Invoked in Example 9.1.9.) oe zz aS ae 
+ 1.6.21. (Invoked in the proof of Theorem 9.4.1.) For all numbers x,y, x” — y” = (a — 
y) (a™—t $a Py + oP 3y? +--+ y") Le, a — y” = (xy) ce gai kyk 
+ 1.6.22. (Invoked in the Ratio tests Theorems 8.6.6 and 9.2.3.) Let r be a positive real 
number. 

i) Suppose that for all positive integers n > no, Qn41 < Tn. Prove that for all 

positive integers n > No, Qn <7" "Ano. 


ii) Suppose that a,41; ra, where His one of <, >, >. Prove that a, Or"~a,,,. 
1.6.23. Let A, = 17 +274 374---+(2n—1)? and B, = 17+37+5?+---+(2n—-1)?. 
Discover formulas for A, and B,, and prove them (by using algebra and previous problems, 


and possibly not with induction). 


1.6.24. (From the American Mathematical Monthly 123 (2016), page 87, by K. Gaitanas) 


Prove that for every n > 2, 3"=} (ety =1-4. 


1.6.25. Let A, = rs + 55 + a ferret Ect Discover a formula for A, and prove it. 


1.6.26. How many handshakes happen at a gathering of n people if everybody shakes 


everybody else’s hands exactly once. 
1.6.27. Find with proof an integer no such that n? < 2” for all integers n > no. 
1.6.28. Find with proof an integer no such that 2” < n! for all integers n > no. 


1.6.29. For any positive integer n and real number x define S,, = 1+a+a?2+---+2". 
i) Prove that for any n > 2, xS,-1 +1= Sp, and that S,(1—2) =1-—27"}, 
ii) Prove that if s =1, then S, =n +1. 
l—gnrt! 


iii) Prove that if #1, then S, = -G=—. Compare with the proof by induction in 
Example 1.6.4. 


1.6.30. (Fibonacci numbers) Let s; = 1, sz = 1, and for all n > 2, let $,41 = 8, 4+5n-1. 
This sequence starts with 1, 1,2,3,5,8, 13, 21, 34,.... (Many parts below are taken from the 
book Fibonacci Numbers by N. N. Vorob’ev, published by Blaisdell Publishing Company, 
1961, translated from the Russian by Halina Moss; there is a new edition of the book with 
author’s last name written as Vorobiev, published by Springer Basel AG, 2002, translated 
from the Russian by Mircea Martin.) 


i) Fibonacci numbers are sometimes “motivated” as follows. You get the rare gift of 
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a pair of newborn Fibonacci rabbits. Fibonacci rabbits are the type of rabbits who 
never die and each month starting in their second month produce another pair of 
rabbits. At the beginning of months one and two you have exactly that 1 pair of 
rabbits. In the second month, that pair gives you another pair of rabbits, so at 
the beginning of the third month you have 2 pairs of rabbits. In the third month, 
the original pair produces another pair of rabbits, so that at the beginning of the 
fourth month, you have 3 pairs of rabbits. Justify why the number of rabbits at 
the beginning of the nth month is s,,. 

ii) Prove that for alln > 1, s, = a (434)" ’ (4)". (It may seem amazing 


V5 
that these expressions with square roots of 5 always yield positive integers.) Note 


that the base case requires proving this for n = 1 and n = 2, and that the inductive 
step uses knowing the property for the previous two integers. 


iii) Prove that s, +83 +85 +---+San-1 = San- 


iv) Prove that sg + s4+ 86 +--:+ San = Sen4i — 1. 
Prove that 51 + 82 + 83 +:+--+8n = 8n42—1. 


vi) Prove that s; — s9 + 3 — 84 +---+ San_1 — San = 1— S2n_1. 


Vv 
vii) Prove that 81 —Sog+ S83 — S4 +++ + Son 1 — San + S2nt1 = §2n +1. 
Prove that 81 — 82 + 83 — 84 +++-+(—1)"t1s, = (-1)""18,_1 +1. 
Prove that for all n > 3, sp) > (leis ne 


viil 


ix 
x) Prove that for all n > 1, 5? + se fees f S = SnSnii> 
xi) Prove that 8n418n—1 — 82 = (-1)”. 


xii) Prove that s1s2 + 8283 +--+ + San—182n = Sn. 

Prove that 5182 + 8283 +-+++ S2nS2n41 = S3n41 — Ll. 

Prove that ns, + (n — 1)s9 + (n — 2)83 +--+ + 28,1 + Sn = Sn44 —(n4+3). 
Prove that for all n > 1 and all k > 2, sniz = SpSn41 + SK-18n- 


xii 


X1V 


XV 


xvi) Prove that for all n,k > 1, spn is a multiple of s,. (Use the previous part.) 


Prove that sen41 = 24 + S. 
2 2 


n- 


XVil 
xviil) Prove that so, = s 


xX1Xx 


Prove that s3n = s3,, + s2 —s3 


Prove that sn41 = (2434) Sn + ( 4)". 
xxi) Prove that s3 + 86 + s9 +---+ 53n = ees (Use the previous part.) 
Prove that s? + s3+s$+---+s3 = Se ee 
(1+ v5)" 
n/5 2° 


(xxiv)* If you know a bit of number theory, prove that for all positive integers m,n, the 


XX 


NS wa rw WoW ow owe oe oe Wa oe oe oe oe a eae wae aes Ss 


XXll 


xxiii) Prove that |s, 


greatest common divisor of $m and 8p iS Sged(m,n): 
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(xxv)* Prove that s, is even if and only if n is a multiple of 3. 
Prove that s,, is divisible by 3 if and only if n is a multiple of 4. Prove that s,, is 
divisible by 4 if and only if n is a multiple of 6. Prove that s,, is divisible by 5 if 
and only if n is a multiple of 5. Prove that s,, is divisible by 7 if and only ifn is a 
multiple of 8. Prove that there are no Fibonacci numbers that have the remainder 
of 4 when divided by 8. 
Prove that there are no odd Fibonacci numbers that are divisible by 17. 


Sn+1 Sn | 


xxvi) If you know matrices, prove that for all n > 2, E : = : 
n n-1 


1 0 


1.6.31. (Via the grapevine, based on ideas of Art Benjamin, Harvey Mudd College, and 
Dan Velleman, Amherst College) A tromino is a plane figure composed of three squares in 


7 


i) Prove that for every positive integer n, any 2” x 2” square grid with exactly one 


L-shape: 


of the squares removed can be tiled with trominoes. 


ii) Prove that for every positive integer n, 4” — 1 is an integer multiple of 3. 


1.6.32. Pick a vertex V in a triangle. Draw n distinct lines from V to the opposite edge 
of the triangle. If n = 1, you get the original triangle and two smaller triangles, for a total 
of three triangles. Determine the number of distinct triangles obtained in this way with 


arbitrary n. 


1.6.33. (Spiral of Theodorus) Draw a triangle with vertices at (0,0), (1,0), (1,1). The 
hypotenuse has length V2. 
i) One of the vertices of the hypotenuse is at (0,0). At the other vertex of the 
hypotenuse, draw an edge of length 1 at the right angle away from the first triangle. 
Make a triangle from the old hypotenuse and this new edge. What is the length of 
the hypotenuse of the new triangle? 
ii) Repeat the previous step twice. 


iii) Prove that one can draw \/n for every positive integer n. 


1.6.34. (Tower of Hanoi) There are 3 pegs on a board. On one peg, there are n disks, 
stacked from largest to smallest. The task is to move all of the disks from one peg to a 
different peg, given the following constraints: you may only move one disk at a time, and 
you may only place a smaller peg on a larger one (never a larger one on a smaller one). 
Let S;, be the least number of moves to complete the task for n disks. 
i) If nm = 1, then what is the least number of moves it takes to complete the task? 
What if there are 2 disks? Repeat for 3, 4, 5 disks. 
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ii) Make a recursive formula (defining S, based on S,,_;) for this S,,. Then, make a 

guess for a non-recursive formula for S,, (defining S,, based on n without invoking 

S,—1). Prove your guess using induction and the recursive formula that you wrote. 

1.6.35. What is wrong with the following “proof by induction”? I will prove that 5” + 1 

is a multiple of 4. Assume that this is true for n — 1. Then we can write 5°-!'+1 = 4m 
for some integer m. Multiply this equation through by 5 to get that 


5" +5 = 20m, 


whence 5” + 1 = 4(5m— 1). As 5m — 1 is an integer, this proves that 5” + 1 is a multiple 
of 4. 


1.6.36. What is wrong with the following “proof by induction” besides the fact that the 
conclusion is false for many n? I will prove that all horses are of the same color. This is the 
same as saying that for any integer n > 1 and any set of n horses, all the horses belonging 
to the set have the same color. If n = 1, of course this only horse is the same color as 
itself, so the base case is proved. Now let n > 1. If we remove one horse from this set, the 
remaining n — 1 horses in the set are all of the same color by the induction assumption. 
Now bring that one horse back into the set and remove another horse. Then again all of 
these horses are of the same color, so the horse that was removed first is the same color as 
all the rest of them. 


1.7 Pascal’s triangle 


Pascal’s triangle is very useful, so read this section with the exercises. 
The following is rows 0 through 8 of Pascal’s triangle, and the pattern is obvious 


for continuation into further rows: 


rowQ0: . . . . .) : ‘ ; 1 

TOW EY 4. a As cA ok : : 1 1 

TOW 2S “fe tae BS ee Mw : 1 2 1 

rows: . . . . . . I 3 3 1 

TOW AS ac ce awk we 4 6 4 1 

rowd: . . . . 1 5 10 10 i) 1 
row6: . . . 1 6 15 20 15 6 1 
rowy7: . . 1 7 21 30 35 21 7 1 


Note that the leftmost and rightmost numbers in each row are all 1, and each of the other 
numbers is the sum of the two numbers nearest to it in the row above it. We number the 
slanted columns from left to right starting from 0: the Oth slanted column consists of all 1s, 
the 1st slanted column consists of consecutive numbers 1, 2,3,4,..., the 2nd slanted column 
consists of consecutive numbers 1,3,6,10,..., and so on for the subsequent columns. 
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Let the entry in the nth row and kth column be denoted @) We read this as “n 
choose k”. These are loaded words, however, and we will eventually justify them. 
Pascal’s triangle is defined so that for all n > 1 and allk =0,1,...,n—1, 


What would it take to compute ( 5 
0 through 100 of Pascal’s triangle, or actually a little less, only slanted columns 0 through 


)? It seems like we would need to write down rows 


5 of these 101 rows. That is too much drudgery! We will instead be smart mathematicians 


and we will prove many properties of Pascal’s triangle in general, including shortcuts for 
100 
5 


by mathematical induction. 


computing ( Me We will accomplish this through exercises, most of which can be proved 


Exercises for Section 1.7 
1.7.1. Prove that the sum of the entries in the nth row is 2”. 


1.7.2. Let & be an arbitrary non-negative integer. (This means that k is given to you not 


as a specific number but as an unknown integer.) Prove by induction on n that the sum 
n+1 


of the entries in the slanted column & in rows j < n is ( & Hale In other words, prove that 
Gy a Ca) + ae) +..-4+() = Gas which is the same as proving that wee (7) = Cay 
1.7.3. Prove that every integer n > 0 has the property that for all k = 0,1,2,...,n, 
(e) = GET 

1.7.4. Prove that (7) = ul eee LE 2}(n Best) 

1.7.5. Compute (4), (3s Qs Gs Qs C2): C2). C2), 29). 


1.7.6. Prove that (7) is the number of possible k-member teams in a club with exactly n 


members. For this reason (7) is read n choose k. 


+ 1.7.7. (Invoked in Theorem 2.9.2, Example 8.2.9, Theorem 6.2.3.) Prove that for all 


non-negative integers n, 
(a+b)" = ~ (;) ae 
k=0 ‘ 


(Since a+b contains two summands, it is called a binomial, and the expansion of (a+ 6)” 
is called the binomial expansion, with coefficients a) being called by yet another name 
in this context: binomial coefficients. ) 
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1.7.8. Express each of the following as a + b\/2 for some integers a, b: 
i) V2 — 1, (V2 — 1) (V2 = 1), (V2 = es (V2 - Ps 
ii) Write each of the five expressions in the previous part in the form /¢ — Vd for 
some positive integers c, d. 
(iii)* Do you see a relation between c and d for each expression in the previous part? Is 
there a general rule? Can you prove it? 
1.7.9. Prove that for all positive integers n, }7_9(—1)* (7%) = 0. Compute>*7_9(—1)* (7) 
in case n = 0. 


1.7.10. Prove that for any non-negative integer k, 


YG +YG+2)-- G+) = MEV F eee) 


j=l 


n 


(Hint: induction on n or instead use Exercises 1.7.2 and 1.7.4.) 


1.7.11. Use Exercise 1.7.10 to get succinct simplifying formulas for 


n 


DA LIG+D, DIGH+DG+2), DIG+ DG +2)G +9). 


j=l j=l 
i) Note that 7? = j(j +1) — j. Use the simplifications from above to prove that 
Se 2 n(n+1)(2n+1) 
j=alJ = 6 ; 
ii) From 7? = j(j +1)(j + 2) — 37? — 27 = 9G +1)(G +2) — 37(7 +1) +f develop the 
formula for }7_4 a 
iii) Mimic the previous work to develop the formula for Sa 54. 
1.7.12. Prove that for all non-negative integers n and all k = 0,1,...n, 9) < ae 


1.7.13. Fix a positive integer k. Prove that there exists a positive number C’ such that 
for all sufficiently large integers n, Cn® < eae 


1.7.14. Give reasons why we should have (i) = 0 for n < k or if either k or n is negative. 


1.7.15. Let d be a positive integer. This is about summing entries in Pascal’s triangle 
along the dth northwest-southeast slanted column: Prove by induction on n > O that 


n d+k\ _ (d+n+1 
Ee Coe) 
*1.7.16. Prove that for all non-negative integers n, 
n 9 2 n-1 9 2 
n n 
Oe | Se 2° 
(> Ga) -1=2 (5 Get)=") 
k=0 k=0 


and 
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For notation’s sake you may want to label E, = )°y"q (5,,)2* and On = r7% 9 (ea) oe 
where €n, On are the largest integers such that 2e, <n and 20, +1 <n. The claim is then 
that for all n > 0, E3, —1 = 203, and E%,,, +1 = 203,,,. (Hint: use the definition 
of (2) to rewrite EF, in terms of E,_1,On—1. Proceed with induction.) 


Chapter 2: Concepts with which we will do mathematics 


In this chapter we introduce abstract structures and we examine them on familiar 
notions of numbers and functions from high school mathematics. 'The development of 
abstract tools enables us to analyze with common proofs rather than separately in particular 
all familiar and also many non-yet familiar number structures. This abstraction is akin to 
abstracting the concrete calculations 07, 0.17, 0.27, 0.257, 0.37, 17, 1.17,... into the squaring 
function f(z) = x7; while at first it may be hard to grasp x”, by this point in your 
mathematical career you are able to manipulate this function in many ways with ease. 
By the end of this chapter you will similarly be at ease with the abstracted notions of a 


function or of a number field, and more. 


2.1 Sets 


What is a set? Don’t we already have an idea of what a set is? The following 
informal definition relies on our intuitive idea of a set while making precise some notation 
and vocabulary of membership. 


Definition 2.1.1. A set is a collection of objects. These objects are called members or 
elements of that set. If m is a member of a set A, we write m € A, and also say that A 
contains m. If m is not a member of a set A, we writem ¢ A. 


The set of all polygons contains triangles, squares, rectangles, pentagons, and so on. 
The set of all polygons does not contain circles or disks. The set of all functions contains 


the trigonometric, logarithmic, exponential, constant functions, and so on. 


Examples and notation 2.1.2. 
(1) Intervals are sets: 
(0,1) is the interval from 0 to 1 that does not include 0,1. From the context you 
should be able to distinguish between the interval (0,1) and a point (0,1) in the 
plane. 
More generally, below we take real numbers a and b with a < b. 
(a, b] is the interval from a to b that includes 6 but not a. 
(a, b) is the interval from a to 6 that includes a but not b. 
(a, b] is the interval from a to 6 that includes a and b. 
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(a, 00) is the interval of all real numbers strictly bigger than a. 


) 
[a, 00) is the interval of all real numbers bigger than or equal to a. 


oe) 
(—oo, b) is the interval of all real numbers strictly smaller than b. 

(—oo, b] is the interval of all real numbers smaller than or equal to b. 

(—oo, 00) is the set of all real numbers. 

(0,00) is the set of all positive real numbers. 

[a,a] is the “interval” consisting of precisely one number a. 

(a, a), (a,a],{a,a) are the “interval” consisting of no numbers. 

The set with no elements is called the empty set and is denoted @ or {}. 

The set {0} is not empty because it contains the empty set. 

Descriptively we can say for example that a set A is the set of all prime numbers. 
The description may be more loaded with symbols: A is the set of all real numbers x 
with the property that x? > 42% — 3. (Use the quadratic formula to prove that A 
consists of the numbers that are strictly smaller than 1 or strictly larger than 3.) 
We can list the elements and surround them with curly braces to define a set: 

(i) {1, 2,3} is the set consisting of precisely 1,2 and 3. 

(i) {1,2,3,2} is the set consisting of precisely 1,2 and 3. Thus {1,2,3,2} = 
11,2) 3}. 

(iii) {blue, “hello” 5} is the set consisting precisely of the color blue, of the word 
“hello”, and of number 5. 

(iv) {1,2, {1,2}} is the set consisting of precisely of numbers 1 and 2 and of the 
set {1,2}. This set has exactly three distinct elements, it is not the same as {1,2}, 
and it is not the same as {{1,2}}. 

When the list of elements is not small enough for reasonable explicit listing but 


” 


the pattern of elements is clear, we can start the list and then add “, ...” when 
the pattern is clear: 

(i) {1,2,3,..., 10000} is the set of all positive integers that are at most 10000. 
(ii) {1,4,9,...,169} is the set of the first 13 squares of integers. 

(iii) {1,2,3,...} is the set consisting of all positive whole numbers. This set is 
often denoted by N. To be absolutely clear, we will write N* for {1,2,3,...} and 
No. for: 40:12, 3)-4¢2)+ 

Warning: {3,5,7,...} or {3,5,7,...,101} could stand for the set of all odd primes 
(up to 101), or possibly for the set of all odd whole numbers strictly greater 
than 1 (up to 101). Avoid ambiguities: write more elements, or write an explicit 
description of the elements instead. 

The set of all whole numbers is written Z, the set of all rational numbers is writ- 
ten Q, the set of all real numbers is written R, and the set of all complex numbers 
is written C (complex numbers are defined in Section 3.9, and until then do not 
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worry when “C” appears in the text). The set of all non-negative real numbers 


equals [0, 00), and we also write it as Ro. 
(8) We can define sets propositionally: if P is a property, then the set 


{x :P(x)} or {x € A: P(x)} 
consists of all x (or x € A) for which P holds. Here are some explicit examples: 
(i) {a € RR: x? =z}, and this happens to be the set {0, 1}. 
(ii) {2 € R: x > 0 and z < 1}, and this happens to be the interval (0, 1). 
(iii) Q= {$:a,b€ Zand b FO}. 
(iv) The set of all positive prime numbers equals {n € N: n > 1 and if n = pq for 


some integers p,q then |p| = 1 or |q| = 1 }. 
(v) Set A = {x : x is a positive integer that equals the sum of its proper factors}. 
(Elements of A are called perfect numbers.) It is easy to verify that 1,2,3,4,5 
are not in A. But 6 has factors 1,2,3,6, and the sum of the factors other than 
6 equals 6. Thus 6 is an element of A. You can verify that the three numbers 
28,496, and 8128 are also in A. (If you and your computer have a lot of time, 
write a program to verify that no other number smaller than 33 million is in A.) 
(9) Proving that a property P holds for all integers n > no is the same as saying that 
the set A = {n € Z: P holds for n} contains {no,no + 1,n9 + 2,...}. By the 
principle of mathematical induction, P holds for all integers n > no is the same as 
saying that no € A and that n — 1 € A implies that n € A. 


Summary of example sets, and their notation 

f = {}: the set with no elements. 
Lasbce) 1a Ds 25% bs 
{x : x can be written as a sum of three consecutive integers}. 
N: the set of all Natural numbers. Depending on the book, this could be the set of all 
positive integers or it could be the set of all non-negative integers. The symbols below 
are unambiguous: 

No: the set of all non-negative integers; 

N*: the set of all positive integers. 
Z: the set of all integers (“Zahlen” in German). 
Q: the set of all rational numbers (Quotients). 


R: the set of all Real numbers. 


C: the set of all Complex numbers (more about them starts in Section 3.9). 


Just like numbers, functions, and logical statements, sets and their elements can also 
be related and combined in meaningful ways. The list below introduces quite a few new 


52 Chapter 2: Concepts with which we will do mathematics 


concepts that may be overwhelming at first, but in a few weeks you will be very comfortable 
with them. 


Subsets: A set A is a subset of a set B if every element of A is an element of B. In that 
case we write A C B. For example, N‘ C No C ZC QCR. The non-subset relation 
is expressed with the symbol Z: R ZN. 

Every set is a subset of itself, i.e., for every set A, A C A. 


The empty set is a subset of every set, i.e., for every set A, J C A. 

If A is a subset of B and A is not equal to B (so B contains at least one element that 
is not in A), then we say that A is a proper subset of B, and we write A ¢ B. For 
example, N* CN) CZCQER. 


Equality: Two sets are equal if they consist of exactly the same elements. In other words, 
A= B if and only if AC Band BCA. 


Intersection: The intersection of sets A and B is the set of all objects that are in A 


and in B: 
ANB={x:2¢€Aandz ec B}. 


When AN B = 9, we say that A and B are disjoint. 
Union: The union of sets A and B is the set of all objects that are either in A or in B: 
AUB={ax:2E€Aorze Bh. 
Intersections and unions of arbitrary families of sets: We have seen intersections 


and unions of two sets at a time. We can also take intersections and unions of three, 
four, five, and even infinitely many sets at a time. Verify the equalities below: 


(ANB)NC=AN(BNC), 
(AUB)UC =AU(BUC), 

(AN B)N(CND) =AN(BNCND), ete 
(AUB) U(CUD) =(AU(BUC))UD, ete 


(Verification of the first equality above: Let x € (AN B) NC. This holds if and only 
if x € AN B and zx € C, which holds if and only if x € A,x € B and x € C, which in 
turn holds if and only if z € A and z € (BNC), ie., if and only ifr €e AN(BNC).) 
Thus having established that parentheses above are irrelevant, we simply write the 
four sets above as AN BNC, AUBUC, ANBNCND, AUBUCUD, respectively. 


More generally, given sets A;, Ao,..., An, we write 


() Ax = ALN A2N- ++ An = {a: a € Ag for all he ys 
k=1 
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n 
U Ay = Ay UAgQU-:-UA, = {a: Jk =1,...,n such that a € Ax}. 
k=1 
The k in the subscripts are referred to as indices of unions or intersections. The 
indices can be taken from arbitrary sets, even infinite ones. We call such sets index 
sets. Notationally, if J is an index set, we write 


() An = {a:a@€ Ag for all k € I}, 


kel 
UJ Ax = {a: there exists k € I such that a € Ax}. 
kel 
When J = {1,2,...,n}, then the intersections and unioins in the two lines above are 


the same as those in the previous display. 


When J is the empty index set, one can argue as for empty sums in Section 1.5 that 


J Ar = 9, 


keo 
i.e., the empty union is that set which when unioned with any other set returns that 
other set. The only set which satisfies this property is the empty set. Similarly, 
the empty intersection should be that set which when intersected with any other set 
returns that other set. However, this empty intersection depends on the context: when 
the allowed other sets vary over all subsets of a set X, then the empty intersection 
equals X. We return to this theme in Section 2.5. 


Complement: The complement of A in B is 
B\A={beEB:b¢A}. 


Some authors write B — A, but that has another meaning as well: B— A: {b-—a:be 
Bandaeé A}. Always try to use precise and unambiguous notation. 


We often have an implicit or explicit universal set that contains all elements of our 
current interest. Perhaps we are talking only about real numbers, or perhaps we are 
talking about all functions defined on the interval [0, 1] with values being real numbers. 
In that case, for any subset A of the universal set U, the complement of A is the 
complement of A in U, thus U \ A, and this is denoted as A°. 
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Summary notation and vocabulary 
a € A: ais an element of a set A. 
ACB: Aisa subset of set B; every element of A is an element of B. 
ACB: AC Band AF B; Aisa proper subset of B. 
A=B:ACBand BCA. 
AQ B: the set of all elements that are in A and in B. 
A and B are disjoint: AN B = 0. 
AUB: the set of all elements that are either in A or in B. 
A \ B: the set of all elements of A that are not in B. 
A®: the set of all elements in the universal set that are not in A. 


Example 2.1.3. We prove that Z = {8m +4n:m,n € Z}. Certainly for any integers m 
and n, 3m + 4n is also an integer, so that {83m+4n:m,n € Z} C Z. Now let x € Z. Then 


x=1-xe=(4-3)-2 = 3(-2) 4+ 4a, 


so that x € {8m+4n: m,n € Z}, whence Z C {3m + 4n: m,n € Z}. Since we already 
proved the other inclusion, the proof is done. 


Example 2.1.4. We prove that A = {6m + 14n: m,n € Z} equals the set B of all even 
integers. Certainly for any integers m and n, 6m + 14n is an even integer, so that A C B. 
Now let « € B. Then z is even, so x = 2n for some integer n. Write 


v= 2n = (14-2-6)n = 6(—2n) + 14n, 


so that x € {6m+14n: m,n € Z} = A. Thus B C A. Together with the first part this 
implies that A = B. 


Example 2.1.5. The complement in Z of the set of even integers is the set of odd integers. 
The complement in Q of the set of even integers contains many more elements than odd 


integers. For example, it contains 5, z, rye 


Example 2.1.6. If A C C, then C \ (C'\ A) = A. In other words, the complement of the 


complement of A is A. 


Proof. Let x € A. Then z is not in the complement C' C A of A in C. Since x € C and not 
in the subset CC A of C, it follows that x € C\(C'\ A). This proves that A C C\(C\ A). 
Now let x € C\(C'\ A). Then x € C and z is not in C\ A. Thus necessarily x € A. 


This proves that C\(C \ A) C A. 
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Example 2.1.7. Let A and B be subsets of C. Then A C B if and only if C\ BC C\A. 


Proof. (=) Suppose that A C B. Let x € C\ B. Then « € C and z is not in B. Since 
A C B, then necessarily x is not in B. This proves that x € C C B. Since x was arbitrary 
in C \ B. it follows that C\ BC C\A. 


(<) Suppose that C\ B C C’\ A. Then by the previous paragraph, C \ (C \ A) C 
C \ (C \ B), and so by Example 2.1.6, A C B. 


Example 2.1.8. For each i € N*, let A; = [i,00), B; = {i,i+1,1+2,14+3,...}, and 
C; = (1,7). Think through the following: 


() Ax = 9, () Be =9, () Ce = (-1,1), 


keN keEN keEN 
LJ Ar = [1, 00), U B, = N*, @ Ch =k. 
kEN ken keN 


Example 2.1.9. For each real number r, let A, = {r}, B, = [0, |r|]. Then 


() Ar =9, (|) Be=40}, J 4, =R, lB. -='[0;,.60): 


reER reER reER rER 


Set operations can be represented with a Venn diagram, especially in the presence 
of a universal set U. Here is an example: 


A B \|U 


On this Venn diagram, sets are represented by the geometric regions: A is the set 
represented by the left circle, B is represented by the right circle, AM B is the part of the 
two circles that is both in A and in B, AU B is represented by the region that is either in 
A or in B, A \ B is the left crescent after B is chopped out of A, etc. (There is no reason 
why the regions for sets A and B are drawn as circles, but this is traditional.) 


Sometimes we draw a few (or all) elements into the diagram. For example, in 
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we read that A = {z,y,t}, AN B = {t}, et cetera. 
Two disjoint sets A and B are represented by a Venn diagram as follows: 


OC 


Theorem 2.1.10. For all sets A, B,C, we have AN (BUC) = (AN B)U(ANC). 


Proof. With the Venn diagram below, B UC is the region filled with either horizontal 
or vertical lines, A is the region filled with the southeast-northwest slanted lines, and so 
AN (BUC) is the region that has simultaneously the southeast-northwest slanted lines 
and either horizontal or vertical lines. Also, AMC is the region that has horizontal and 
southeast-northwest slanted lines, 4 B is the region that has vertical and southeast- 
northwest slanted lines, so that their union (AN B)U(ANC) represents the total region of 
southeast-northwest slanted lines that either have horizontal or vertical cross lines as well, 
which is the same as the region for AN (BUC). 


We prove this also algebraically. We have to prove that AN( BUC) C (ANB)U(ANC) 
and (AN B) U(ANC) CAN(BUC). 

Let x be arbitrary in AM (BUC). This says that x € A and x € BUC, and the 
latter says that either x € B or x € C. But then either x € (AN B) or x € (ANC), so 
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that « € (AN B)U(ANC). This proves one inclusion. 

Now let x be arbitrary in (AN B) U(ANC). This says that either x € AN B or 
x € ANC, which in turn says that either x is in A and in B or else that x is in A and in 
C.. In any case, x € A, and either x € B or x € C, so that x € A and x € BUC , which 
finally says that x € AN (BUC). This proves the other inclusion. 


Exercises for Section 2.1 
2.1.1. Prove by induction on n that a set with n elements has exactly 2” distinct subsets. 
2.1.2. Prove that a set with n elements has (2) subsets with exactly 7 elements. 


2.1.3. Assume that A and B are disjoint subsets of U. For each part below, draw a Venn 
diagram with A and B in U, and shade in the region described by the set: (i) U \ B, 
(ii) ANB, (iii) U\ (U\ A), (iv) (U\ A)N(U \ B), (vy) (UN B) UU \ (BU A)), Wi) 
(BOU)U(B\U), (vii) (UU \ (U \ A)) UB, (viii) (AU B)U(U \ A). 

2.1.4. Prove the following: 


i) {ex €R: 2? = 3} = { V3, -V3}. 


ii) {72 : cE R}=R. 
{x2:2E€R}={xeER:2x>0} = (0,cv). 
iv eee ar as 


) 
) 
iii) 
) 
v) {c >0: « is an even prime number} = {2}. 
vi) 0 is a subset of every set. Elements of ( are green, smart, sticky, hairy, feathery, 
prime, whole, negative, positive... 
vii) {z : x can be written as a sum of three consecutive integers} = {38n: n € Z}. 
viii) If AC B, then ANB=Aand AUB=B. 
2.1.5. Let U = {1,2,3,4,5,6}, A= {1,3,5}, and B = {4,5,6}. Find the following sets: 
i) (A\ B) U(B\ A). 
ii 
iii 


lv 


) 
i) 
i) 
) 
v) 
i) A 
i) B 
i) 


o—™~ 


UN A)UUNB). 
\(A\ B). 
\(B\ A). 

viii) {A} N {B}. 


vl 


vil 
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2.1.6. Let A,B CU. 
i) Prove that there exist at most 16 distinct subsets of U obtained from A, B,U by 
intersections, unions, and complementation. 
ii) If A and B are disjoint, prove that there exist at most 8 such distinct subsets. 


) 

iii) If A = B, prove that there exist at most 4 such distinct subsets. 

iv) If A= B =U, prove that there exist at most 2 such distinct subsets. 
) 


v) If A= B=U =9, prove that there exists at most 1 such subset. 


2.1.7. Let A,B,C CU. Prove the following statements: 
i) (ANC)\B=(A\B)N(C\B). 
ii) (A\ B)U(B\ A) = (AUB) \ (ANB). 
iii) (AN B) UU \ (AUB)) = (U\ (A\ B)) \ (B\ A). 
) 
) 


1V 


iv) U\ (A\ B)=(U\ A)UB. 
v) IfU=ANB, then A= B=U. 
2.1.8. Let A, BCU. 
i) Prove that (U \ A)N (U\ B) =U \ (AUB). (The intersection of the complements 
is the complement of the union.) 
ii) Prove that (U \ A) U(U \ B) =U \ (ANB). (The union of the complements is the 
complement of the intersection.) 


2.1.9. Compute {] (-1/k,1/k), (| [-1/k,1/k], () {-1/k, 1/k}. 


keNt keNt keNt 
2.1.10. Compute |) (—1/k,1/k), (J [-1/k,1/k], LJ {-1/k, 1/k}. 
keNt keNt keNt 


2.2 Cartesian product 


The set {a,b} is the same as the set {b,a}, as any element of either set is also the 
element of the other set. Thus, the order of the listing of elements does not matter. But 
sometimes we want the order to matter. We can then simply make another new notation 
for ordered pairs, but in general it is not a good idea to be inventing many new notations 
and concepts; it is better if we can reuse and recycle old ones. We do this next: 


Definition 2.2.1. An ordered pair (a,b) is defined as the set {{a}, {a, b}}. 


So here we defined (a, b) in terms of already known constructions: (a,b) is the set one 
of whose elements is the set {a} with exactly one element a, and the remaining element of 
(a, b) is the set {a, b} that has exactly two elements a,b if a 4 b and has exactly one element 
otherwise. Thus for example the familiar ordered pair (2,3) really stands for {{2}, {2,3}}, 


(3,2) stands for: {{3};{2,3}), and (2,2) stands for 4 {2},{2,2\) =f{fo} oll = {fort 
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Theorem 2.2.2. (a,b) = (c,d) if and only ifa =c and b =d. 


Proof. [RECALL THAT P © Q IS THE SAME AS P > Q AND P <= Q. THUS THE PROOF 
CONSISTS OF TWO PARTS. | 

Proof of =: Suppose that (a,b) = (c,d). Then by the definition of ordered pairs, 
{fat {a,b)) = {{c},fad}}. Ifa = 6; this. says that {{a\) =. {{el.{ed}}) so that 
{{c}, {c,d}} has only one element, so that {c} = {c,d}, so that c = d. But then 
{{a}, {a,b}} = {{c}, {c, d}} is saying that {{a}} = {{c}}, so that {a} = {c}, so that 
a=c. Furthermore, b = a = c = d, which proves the consequent in case a = b. Now 
suppose that a 4 b. Then {{a}, {a,b}} = {{c}, {c,d}} has two elements, and so c ¥ d. 
Note that {a} is an element of {{a}, {a,b}}, hence of {{c}, {c,d}}. Thus necessarily either 
{a} = {c} or {a} = {c,d}. But {a} has only one element and {c,d} has two (since c # d), 
it follows that {a} = {c}, so that a = c. But then {a,b} = {c,d}, and since a = ¢, it follows 
that b = d. This proves the consequent in the remaining cases. 

Proof of <: If a = c and b = d, then {a} = {c} and {a,b} = {c,d}, so that 
{{af, {a,b} } = {te}, te ah}. 


Note that by our definition an ordered pair is a set of one or two sets. 


Definition 2.2.3. For any sets A and B, the Cartesian product A x B of A and B is 
the set {(a,b): a € A and b € B} of all ordered pairs where the first component varies over 
all elements of A and the second component varies over all elements of B. 


In general, one can think of A x B as the “rectangle” with A on the horizontal side 
and B on the perpendicular side. 

Say, if A has 4 elements and B has 3 elements, then A x B is represented by the 
12 points in the rectangle with base consisting of elements of A and height consisting of 
elements of B as follows: 


eo—__e_____e____® 


A 


If instead A and B are intervals as above, then A x B is the indicated rectangle. 
When A and B extend infinitely far, then A x B is correspondingly a “large” rectangle: 


The familiar real plane is the Cartesian product R x R. 


(The three-dimensional space can be written as the Cartesian product R x (R x 


R) or the Cartesian product (R x R) x R. In the former case we write elements in the 
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form (a, (b,c)), and in the latter case we write them in the form ((a,b),c). Those extra 
parentheses are there only for notation and to slow us down, they serve no better function, 
so by convention we write elements simply in the form (a, }, c).) 


Exercises for Section 2.2 

2.2.1. How many elements are in —) x R? 

2.2.2. Prove that A x (BUC) =(Ax B)U(AxC). 

2.2.3. Prove that A x (BNC) =(Ax B)N(AxC). 

2.2.4. Prove that (ANC) x (BND) =(Ax B)N(C x D). 

2.2.5. Give examples of sets A, B, C, D showing that (AUC) x (BUD) 4 (Ax B)U(Cx D). 


2.2.6. Let A have m elements and B have n elements. Prove that A x B has mn elements. 
Prove that A x B has 2™” subsets. 


2.3. Relations, equivalence relations 


In this section we introduce relations in a formal way. Most relations that we even- 
tually analyze are of familiar kind, such as <, <, is cousin, taller than, has same birth date, 
et cetera, but we get more structure with a formal approach. 


Definition 2.3.1. A relation on A and B is any subset of A x B. 

A relation on A is a relation on A and A, i.e., a subset of A x A. 

We can give relation a name, such as R, and in place of “(a,b) € R” we alternatively 
write “aRb”. (We prefer to write “3 < 5” rather than “(3,5) €<”.) 


Examples 2.3.2. 


(1) Some relations on R are <,<,=, >, >. We can write (1,2) € <, or more familiarly, 


1 <2. Asa subset of R x R, < consists of all points on or above the line y = x. 
This relation can be drawn (and read off) easily. 


(2) Draw anything in R x R. That defines a relation on R (which most likely cannot 
be expressed with a formula). The relation R = {(a,b): a,b € R and a? <b+1} 
is drawn as the set of all points (a, y) above the parabola y = 2? — 1. 

(3) The following are all the possible relations on A = {1,2} and B = {a,b}: 
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{(1, a), (1, 6), (2; &)F, 
{(1, 4), (1,6), (2,4); 
{(1, 4), (1, 6)}, 
{(1,@), (2,@)}, 

{(1, 4), (2; 6)}, 

{(1, 6), (2, @)}; 

{(1, 6), (2; 6)}, 

{(2, 4), (2, 6)}, 

{(1, @)}; 

{(1, b)}, 

{(2,a)}; 

{(2, b)}, 


—— 
— 


(4) A relation can be known by more than one name. Among the students in a typical 
college classroom, the relations cousin and aunt are identical, i.e., no one student 
is a cousin or aunt of another, so this relation equals {}, the empty relation. 


Definition 2.3.3. Let R be a relation on A. 
(1) R is reflexive if for alla € A, aRa. 
(2) R is symmetric if for all a,b € A, aRb implies bRa. 
(3) R is transitive if for all a,b,c € A, if aRb, bRc, then aRc. 


(4) R is an equivalence relation if it is reflexive, symmetric and transitive. 


Examples 2.3.4. 


(1) < on R is reflexive and transitive but not symmetric. 


< on R is transitive but not reflexive or symmetric. 


= on any set A is reflexive, symmetric, and transitive. 


(2) 
(3) 
(4) Being a cousin is symmetric and neither reflexive nor transitive. 
(5) Being taller than is ... 

(6) 


Let A = {u,v}. Any equivalence relation on A needs to contain (u, u) and (v, v) 
in order to achieve reflexivity. The relation 


{(u, u); (v0) } 


is reflexive, symmetric and transitive. The relations 


{(u,u), (u,v), (u,u)} and {(u, u), (v, v), (v, u)} 
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are reflexive and transitive but not symmetric. The relation 


{(u, u); (v, 0); (u,v); (v, w)F 


is an equivalence relation. Thus there are exactly two equivalence relations on A. 


Definition 2.3.5. Let R be an equivalence relation on a set A. For each a € A, the set 
of all elements b of A such that aRb is called the equivalence class of a. We denote the 
equivalence class of a with the shorthand [a]. 


For example, if R is the equality relation, then the equivalence class of a is {a}. If 
R= Ax A, then the equivalence class of any a is A. If A is the set of all students in 
Math 112 this year, and aRb if students a and 6 are in the same section of Math 112, then 
[a] is the set of all students that are in the same section as student a. 


Theorem 2.3.6. Let R be an equivalence relation on a set A. Two equivalence classes 


are either identical or they have no elements in common. 


Proof. Let a,b € A, and suppose that their equivalence classes have an element in common. 
Call the element c. 

We now prove that the equivalence class of a is a subset of the equivalence class 
of b. Let d be any element in the equivalence class of a. Then aRd, aRc and bRc imply by 
symmetry that dRa and cRb, so that by transitivity dRc. Then dRc, cRb and transitivity 
give dRb, so that by symmetry bRd, which says that d is in the equivalence class of b. Thus 
the equivalence class of a is a subset of the equivalence class of b. 


A symmetric proof shows that the equivalence class of b is a subset of the equivalence 


class of a, so that the two equivalence classes are identical. 


Remark 2.3.7. What this says is that whenever R is an equivalence relation on a set A, 
then every element of A is in a unique equivalence class. Thus A is the disjoint union of 
distinct equivalence classes. Conversely, if A = Uje;A; where the A; are pairwise disjoint, 
define R C A x A as (a,b) € R precisely if a and b are elements of the same A;. Then R is 
an equivalence relation: reflexivity and symmetry are obvious, and for transitivity, suppose 
that a and 6 are in the same A; and 6 and © are in the same A;. Since A; and A; have the 
element b in common, by the pairwise disjoint assumption necessarily 2 = j, so that a and 
c are both in A;. Thus R is an equivalence relation. 
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Example 2.3.8. Let A = {1,2,3,4,5}. The writing of A as {1,2} U {3,4} U {5} makes 
the following equivalence relation on A: 


{(1, 1), (1, 2), (2; 1), (2,2), (3, 3), (8; 4); (4,3), (4 4), (5, 5)F. 


This means is that counting all the possible equivalence relations on A is the same as 
counting all the possible writings of A as unions of pairwise disjoint subsets. (In contrast, 
the number of all possible relations on a set A equals the number of subsets of A x A.) 


Important example 2.3.9. Let n be a positive integer. Let R be the relation on Z given 
by aRb if a — b is a multiple of n. This relation is called congruence modulo n. It is 
reflexive because for every a € Z, a — a = 0 is an integer multiple of n. It is symmetric 
because for all a,b € Z, if aRb, then a— b = x-n for some integer x, and so b—a = (—2)-n, 
and since —2 is an integer, this proves that bRa. Finally, this relation R is transitive: let 
a,b,c € Z, and suppose that aRb and bRc. This means that a—b = x-n and b—c = y-n for 
some integers x and y. Then a—c = a+(—b+b)—c = (a—b)+(b-c) = a-nt+y-n = (xt+y)-n, 
and since x+y is an integer, this proves that aRc. Thus R is an equivalence relation. If aRb 
for this relation R, we say that a is congruent to b modulo n, or that a is congruent 
to b mod n. (Normally in the literature this is written as a = bmod n.) We denote the 
set of all equivalence classes with Z/nZ, and we read this as “Z mod n Z”. This set 
consists of [0], [1], [2],...,[n — 1], [n] = [0], [pn + 1] = [1], et cetera, so that Z/nZ has at 
most n equivalence classes. Since any two numbers among 0,1,...,n — 1 have difference 
strictly between 0 and n, it follows that this difference is not an integer multiple of n, so 
that [0], [1], [2],...,|n—1] are distinct. Thus Z/nZ has exactly n equivalence classes. Two 
natural lists of representatives of equivalence classes are 0,1,2,...,n —1 and 1,2,...,n. 
(But there are infinitely many other representatives as well.) 

For example, modulo 12, the equivalence class of 1 is the set {1, 13, 25,37,...} U 
{—11, —23, —35,...}, and the equivalence class of 12 is the set of all multiples of 12 (in- 
cluding 0). 

In everyday life we use congruence modulo 12 (or sometimes 24) for hours, modulo 
12 for months, modulo 7 for days of the week, modulo 4 for seasons of the year, modulo 3 
for meals of the day ... 

There are exactly two equivalence classes for the congruences modulo 2: one consists 
of all the even integers and the other of all the odd integers. There is exactly one equivalence 
class for the congruences modulo 1: all integers are congruent modulo 1 to each other. For 
the congruences modulo 0, each equivalence class consists of precisely one element. 
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Example 2.3.10. (Construction of Z from No.) Consider the Cartesian product No x No. 
Elements are pairs of the form (a,b), with a,b € No. If a,b, c,d € No, we write (a, b) R(c, d) 
ifa+d=b+c. Thus R is a relation on No x No. (Certainly you are familiar with Z, 
in which case you may want to think of this relation simply saying that (a,b)R(c,d) if 
a—b=c-—d. The problem is that in No we may not be able to subtract b from a and still 
get an element from Np.) 

(1) R is reflexive: because for all (a,b) € No x No, by commutativity of addition, 
a+b=b+a, so that by definition of R, (a,b) R(a,b). 

(2) R is symmetric: if (a,b)R(c,d), then by definition a + d = b+, so that by 
commutativity of addition, d+ a= c+b, and by symmetry of the = relation, 
c+b=d-+a. But by definition this says that (c,d) R(a,b). 

(3) R is transitive: if (a,b)R(c,d) and (c,d)R(e, f), then by definition a+d=b+c 
and c+ f =d+e. It follows that (a+ d)+(c+f) = (b+c)+(d+e). By 
associativity and commutativity of addition, (a+ f) + (c+d) = (b+e)+(c+d), 
and by cancellation then a + f = b+ e, which says that (a,b)R(e, f). 

Now we define a set Z to be the set of equivalence classes for this relation. Every element 
(a,b) of No x No is in an equivalence class: if a > b, then (a,b) R(a—b,0), and if a < b, then 
(a, b)R(0,b—a). Thus for each (a,b) € No x No, there is an element in the equivalence class 
of (a,b) of the form (0, e) or (e,0) for some e € No, and it is left for the reader to verify that 
this e is unique. In Definition 3.5.2 and beyond we identify the set of equivalence classes 
with the usual integers in Z as follows: the equivalence class of (e,0) corresponds to the 
non-negative integer e and the equivalence class of (0,e) corresponds to the non-positive 
integer —e. In Section 3.5 we develop the arithmetic on these equivalence classes (and 
hence on Z) from the arithmetic on No. 


Exercises for Section 2.3 

2.3.1. Let A = {a,b} and B = {b, c,d}. 

i) How many elements are in the set A x A? 

ii) How many elements are in the set A x B? 

iii) How many elements are in the set B x B? 

iv) How many relations are there on A? How many relations are there on B? (Recall 
Definition 2.3.1 for the definition of relation.) 

v) How many relations are there on A and B? How many relations are there on B 
and A? 


vi) How many relations are there on AU B and AN B? 
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2.3.2. Let A have n elements and B have m elements. How many distinct relations on A 
and B are there? Prove. 
2.3.3. In each part below, find a relation with the given properties. You may contrive a 
relation on a contrived set A. 

i) Reflexive, but not symmetric and not transitive. 

ii) Reflexive and symmetric, but not transitive. 
iii) Reflexive and transitive, but not symmetric. 


) 
) 
iv) Symmetric, but not reflexive and not transitive. 
v) Transitive, but not symmetric and not reflexive. 
) 


vi) Transitive and symmetric, but not reflexive. 


2.3.4. Let A be a set with 2 elements. Count all equivalence relations on A. Repeat first 
for A with 3 elements, then for A with 4 elements. 


2.3.5. Let A be a set with n elements. Let R be an equivalence relation on A with fewest 
members. How many members does R have? 


2.3.6. Let R be the relation on R given by aRb if a — b is a rational number. Prove that 


R is an equivalence relation. Find at least three disjoint equivalence classes. 


2.3.7. Let R be the relation on R given by aRb if a — b is an integer. 
i) Prove that R is an equivalence relation. 
ii) Prove that for any a € R there exists b € [0,1) such that [a] = [0]. 


2.3.8. Let R be a relation on R x R given by (a,b)R(c, d) if and only if a—c and b—d are 
integers. 
i) Prove that R is an equivalence relation. 
ii) Prove that for any (a,b) € R x R there exists (c,d) € [0,1) x [0,1) such that 
[(a, 6)] = [(e, d)]. 


iii) Prove that the set of equivalence classes can be identified with [0,1) x [0,1). 


iv) For fun: check out the video game Asteroids online for a demonstration of this 
equivalence relation. Do not get addicted to the game. 
2.3.9. Let A be the set of all lines in the plane. 
i) Prove that the relation “is parallel to” is an equivalence relation. Note that the 
equivalence class of a non-vertical line can be identified by the (same) slope of the 
lines in that class. Note that the vertical lines are in their own equivalence class. 


ii) Prove that the relation “is perpendicular to” is not an equivalence relation. 
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} 2.3.10. (Invoked in Section 3.6.) For (a,b), (a’,b’) € Z x Z \ {0}, define (a, b) 2 (a’, 0’) if 
a:b’ =a'-b. 

i) Prove that 2 is an equivalence relation. (Possibly mimic Example 2.3.10.) 

ii) Describe the equivalence classes of (0,1), (1,1), (2,3)? 


iii) Find a natural identification between the equivalence classes and elements of Q. 


2.4 Functions 


Here is the familiar definition of functions: 


Definition 2.4.1. Let A and B be sets. A function from A to B is a rule that assigns to 
each element of A a unique element of B. We express this with “f : A + B is a function.” 
The set A is the domain of f and B is the codomain of f. The range or image of f is 
Image(f) = Range(f) = {b€ B: b= f(a) for someae€ A}. 


But in the spirit of introducing few new notions, let’s instead define functions with 
the concepts we already know. Convince yourself that the two definitions are the same: 


Definition 2.4.2. Let A and B be sets. A relation f on A and B is a function if for all 
a € A there exists b € B such that (a,b) € f and if for all (a,b), (a,c) € f, b=c. In this 
case we say that A is the domain of f, B is the codomain of f, and we write f : A > B. 
The range of f is Range(f) = {b€ B: there exists a € A such that (a,b) € f}. 


Note that this second formulation is also familiar: it gives us all elements of the 
graph of the function: b = f(a) if and only if (a,b) is on the graph of f. We freely change 
between notations f(a) = b and (a,b) € f. 

One should be aware that if f is a function, then f(a) is an element of the range 
and is not by itself a function. (But often we speak loosely of f(a) being a function, such 
as “x? is a function” .) 


To specify a function one needs to present its domain and its codomain, and to 
show what the function does to each element of the domain. 
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Examples 2.4.3. 


(1) 


(10) 


(11) 


A function can be given with a formula. 


For example, let f : R > R be given by f(x) = ree The range is (0, 1]: For all 


xz, 1+2? > 1 with equality when x = 0. Thus f(x) € (0,1). For any y € (0,1), 


1/y >1,s01/y—1>0,soxz= Vy —1 is a positive real number and f(x) = y. 
So indeed the range of f is (0, 1]. 


Here are formula definitions of two functions with domains [0,00): f(x) = \/x and 
g(x) = —V/x. Note, however, that h(a) = +,/x is NOT a function! 


There may be more than one formula for a function, each of which is applied to 
distinct elements of the domain. For example, define f : Nt > Z by 
n=l if n is odd; 
finy=4 3 


=, ibm is.even; 


Let f : N* > R be given by the description that f(n) equals the nth prime. 
By Euclid’s theorem (proved on page 33) there are infinitely many primes so that 
f is indeed defined for all positive integers. We know that f(1) = 2, f(2) = 3, 
f (3) =5, and with computer’s help I get that f(100) = 541, f(500) = 3571. There 
is no algebraic formula for the nth prime. 


For any set A, the identity function id4 : A > A takes each « to itself. 


Let b € B. A function f : A> B given by f(a) = 6 for all a is called a constant 


function. 


The constant function f : R > R given by f(x) = 1 for all x is not the identity 


function. 


A function may be presented by a table. Here is an example. 


_| f(x) 
fi- 21 
D2) <7 
3°| 2 


A function may be presented in a pie chart, histogram, with words, in a weather 


map... 


A function f : N* > R can be given recursively, such as the Fibonacci numbers 
f(1) =1, f(2) = 1, and for alln > 2, f(n+1) = f(n)+f(n—1). See Exercise 1.6.30 
for more on these numbers. 


A function can be given by its graph: 
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From this particular graph we surmise that f(0) = 0, but with the precision of 
the drawing and our eyesight it might be the case that f(0) = 0.000000000004. 
Without any further labels on the axes we cannot estimate the numerical values 
of f at other points. Typically the graph should be filled in with more information. 
The arrows on the graphs indicate increasing values. 


(12) A function may be presented with Venn diagrams and arrows: 


Remark 2.4.4. Two functions are the same if they have the same domains, the same 
codomains, and if to each element of the domain they assign the same element of the 


codomain. 


For example, f : R > R and g: R — (0,00) given by f(x) = 2? and g(x) = 2? are 


not the same! On the other hand, the functions h,k : R > R given by h(x) = |a| and 
k(x) = Vax? are the same. 


Notation 2.4.5. It is common to not specify the domain, in which case the domain is 
implicitly the largest possible subset of R on which the function is defined. For example, the 
domain of f(x) = 4 is the set of all non-zero real numbers, and the domain of f(z) = \/Z is 
the set of all non-negative real numbers. (After we introduce complex numbers the domain 
will implicitly be the largest possible subset of C on which the function is defined, see 
page 142.) 

Sometimes instead we want to take smaller domains than possible: 
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Definition 2.4.6. Let f : A > B bea function and let C' be a subset of A. The function 
g:C > B defined by g(c) = f(c) for allc € C C A is called the restriction of f to C or 
f restricted to C. It is commonly written as g = f\c. 


For example, let f : R > Rso be given as f(x) = Vx?. Then f restricted to Rso 
equals the identity function and f restricted to {—2,2} equals a constant function. 


Definition 2.4.7. Let A,B,C,D be sets with BCC. Iff:A—-> Bandg:C- D, 
then the composition g composed with f is a function g of : A > D such that for all 


aé A, (go f)(a) =g9(f(a)). 
If f(x) = (2+ 1)? and g 


( 
2) =9(f(2) = (@t 2) (e417 == (e411, 
=fiet =) Se = 141)? =2°, 


x) =xz° —1, then 


and these three outcomes are distinct. For example, when plugging in x = 1, we get values 


(go f)(1) = 63, (fog)(1) =1, 9A) fC) = 0. 


Remark 2.4.8. It is common to write f? = fof, f? = f?of = fofof, etc. Some 
exceptions to this notation are established in trigonometry for historical reasons: “sin? (ax)” 


20 


stands for “(sinz)*” and not for “sin(sinx)”. If we discovered trigonometry today, then 


sin? would have the latter meaning. 


Example 2.4.9. Let f(x) = /z, and g(x) = x”. The domain of f is Ryo, and the domain 
of g is R. It is possible to compose go f : Ryo > R to obtain (go f)(x) = 2, but when 


composing f and g in the other order, (f og)(x) is not always equal to x. Namely, for any 
negative number zx, (fo g)(x) = —a. 
We just demonstrated that in composing two functions, the order matters. 


Definition 2.4.10. A function f : A > B is injective (or one-to-one) if for alla,a’ € A, 
f(a) = f(a’) implies a = a’. In other words, f is injective if whenever two things map via 
f to one object, then those two things are actually the same. 

A function f : A > B is surjective (or onto) if for all b € B, there exists a € A 
such that f(a) = b. In other words, f is surjective if every element of the codomain is 
mapped onto via f by some element of the domain. 

A function f : A > B is bijective if it is injective and surjective. 


For example, the identity function is always bijective. Constant functions are in- 


jective when the domain consists of one element, and are surjective when the codomain 
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2 


consists of one element. The function f : R — R given by f(x) = 2” is not injective 


because f(—1) = 1 = f(1). It is, however, surjective. The function f : Rso — R given 


by f(z) = Va is injective because the square root function is strictly increasing (more 
about that in Section 2.9), but it is not surjective because —1 is not the square root of 


any non-negative real number. The function f : Rso9 — Rso given by f(x) = x is both 


injective and surjective, thus bijective. 


The following are all the possible functions f : {1,2} > {1,2}, and they are given in 


«| f(z) _| f(x) f(z) x _| f(x) 
1] 1 <2 2 [peo 
2/1 2 2D Drill 2° | 2 


The first and the last are neither injective nor surjective, but the middle two are bijective. 
The following are the eight possible functions f : {1,2,3} — {1,2}: 


tabular form: 


R/S 


In this case no functions are injective, and all non-constant ones are surjective. 
Theorem 2.4.11. The composition of two injective functions is injective. 


Proof. Let f: A— Band g:C > D be injective functions, and suppose that B C C’ so 
that we can compose the two functions. 

If (go f)(a) = (go f)(a’), then g(f(a)) = g(f(a’)). Since g is injective, it follows that 
f(a) = f(a’), and since f is injective, it follows that a= a’. Thus go f is injective. 


Theorem 2.4.12. If f: A— Bandg: B > C are surjective functions, soisgof:A—>C. 
If f and g are both bijective, so is go f. 


Proof. Let c € C. Since g is surjective, there exists b € B such that g(b) = c. Since f is 
surjective, there exists a € A such that f(a) = b. Thus (go f)(a) = g(f(a)) = g(b) =c, so 
that go f is surjective. 


The last statement follows from the first part and Theorem 2.4.11. 


Section 2.4: Functions 71 


Definition 2.4.13. We have seen polynomial functions in Section 1.5: recall that for any 


subset S C R, a function f : S > R is a polynomial function if there exist a non-negative 


integer n and co,Ci,...,Cn € R such that for allx € S, f(x) =cote exten? +++: +en2”. 


A function f : S > R is a rational function if there exist polynomial functions f;, fo : 
S — R such that for all x € S, fo(x) £0 and f(x) = fi(x)/fo(a). 
Similarly there are polynomial and rational functions if all occurrences of “R” above 


are replaced by “Q” or “C”. 


Polynomial and rational functions are a workhorse of analysis. Below are some special 
properties. Further special properties of polynomial functions in greater generality appear 
in Exercise 2.6.13. (The reader has of course encountered trigonometric, exponential, and 
logarithmic functions, which are not polynomial or rational.) 


Definition 2.4.14. Let f :.S — R bea function. A zero or a root of f is any a € S' such 
that f(a) = 0. 


(This meaning of “root” is different from the meaning of “root” in “square root”, 
“cube root” or “100th root” of a number.) 


Theorem 2.4.15. If a polynomial function with real coeffiecients is not constant zero, 
then it has only finitely many roots. Specifically, if f(x) = co + c1@ + Cot? +++» + nx”, 
then the number of roots is at most n. 

The domain of a rational function is the complement of a finite subset of R. 


(The same statement and proof work if the coefficients are complex numbers; complex 
numbers are introduced in Chapter 3.) 


Proof. At least one of c; is non-zero since f(a) is not constant 0. By possibly renaming we 
may assume that c, #0, and by the non-constant assumption, n > 1. Ifn =1, then f has 
only one root, namely —co/ci. Suppose that n > 2.* In general, let a be any root of f. By 
the Euclidean algorithm (Example 1.6.5), there exist polynomial functions q and r such 
that 


f(x) = qa)(@ — a) + r(2), 


* lin = 2, then by the quadratic formula the roots of co + c1x% + cox" are pees eh eA ies There 
are root formulas for cases n = 3,4, but executing them is very time-consuming, they require familiarity with 
complex numbers, the solutions involve sums of cube roots with a few square roots thrown in for good measure, 
and furthermore it can be hard to identify that the ensuing long expression simplifies to a nice root such as 2. 
This, and the existence of computer capabilities, are the reasons that we do not teach such formulas. The formula 
for solutions of cubic polynomials was discovered by Niccolé Fontana Tartaglia (1500-1557) and for quartic ones 
by Lodovico Ferrari (1522-1565). Both formulas were popularized in a book by Gerolamo Cardano (1501-1576). 
There are no formulas for general polynomials of degree n > 5: not only do you and I do not know such a 
formula, but Niels Henrik Abel (1802-1829) and Evariste Galois (1811-1832) proved that no formulas exist using 


only radicals, sums, products, quotients. 
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and the degree of r is strictly smaller than 1, i.e., r(z) is a constant. But if we plug x =a 
into both sides, since a is a root of f, we get that r is the constant zero function, so that 


f(x) = g(a) (@ — a). 


We write q(x) = bo + bia +--+ +bmx™!. By multiplying q(x)(x — a) out we get that 
m=n-—1. By induction, q(x) has at most n — 1 roots. If b is a root of f, then 


0 = f(b) = q(b)(b— a), 


so that either q(b) = 0 of b— a= 0. Thus 6 is either a root of g or b= a. Thus the roots 
of f are a and any of the at most n — 1 roots of q, so that f has at most n roots. This 
proves the first part. 

A rational function is a quotient of two polynomial functions, and the rational func- 
tion is defined everywhere except where the denominator is 0. By the first part this excludes 


only finitely many numbers. 


This theorem is useful in that it assures us that the domain of a rational function 
contains infinitely many points, or even better, that the domain is all except finitely many 
real (or complex) numbers. 

(You are aware that the trigonometric functions sine and cosine have infinitely many 
zeroes and that tangent and cotangent are not defined at infinitely many real numbers. This 
fact, together with the previous theorem, establishes that these four trigonometric functions 
are not polynomial or rational functions. For similar reasons the logarithmic functions are 
not polynomial or rational. We have to work harder to prove that the exponential functions 
are not polynomial or rational.) 


Exercises for Section 2.4 

2.4.1. Fix a positive integer n. Define f : Z > {0,1,2,3,...,n —1} by f(a) is the 
remainder after a is divided by n. In number theory instead of “f(a) = b” one says a 
mod n is b. We can also define a similar (but not the same!) function g : Z > Z/nZ by 
g(a) = [a]. Graph the function f for n = 2 andn=3. 


2.4.2. Define the floor function | | : R — R to be the function such that for all « € R, 
|x| is the largest integer that is less than or equal to x. The range is Z. For example, 
|| = 3, [1] =1, |—1.5] = —2. Graph this function. 


2.4.3. The ceiling function | | : R > R is the function for which [x] is the smallest 
integer that is greater than or equal to x. The range is Z. For example, [7] = 4, [1] = 1, 
[—1.5| = —1. Graph this function. 
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2.4.4. Let A = [—a,a] or A = (—a,a), and let B be a subset of R (or of C). A function 
f : A— B is an odd function (resp. even function) if for all x € A, f(—x) = —f(z) 
(resp. f(—x) = f(x)). Let n be a positive integer, co,c1,..., Cn real (or complex) numbers, 
and f the polynomial function f(x) = cg + cyv@ + cot? +++» +en2". 

i) Suppose that for all even k, c, = 0. Prove that f is an odd function. 

ii) Suppose that for all odd k, c, = 0. Prove that f is an even function. 
2.4.5. Construct a set A and functions f,g: A— A such that fog4#gof. 
2.4.6. Leth: A> B,g:B—>C,and f:C—- D. Prove that fo(goh) =(fog)oh. 


2.4.7. Let f: Rx R—- Rand g:R—- Rx R be given by f(z, y) = x and g(x) = (2,0). 
i) Compute fog and go f. (This means: for each function find the domain and 


codomain, and show what the function evaluates to at each point in the domain.) 
ii) Which among f,g, f°g,g0f are injective, surjective, bijective? 
2.4.8. Let A = {1,2}, B = {2,4}, and C = {3,6,9}. Mark each of the following as a 
function A > B, B+ C, or C7 A. 
i) f = {(2, 6), (4,3)}. 
ii) g = {(9, 1), (6, 2), (3, 2)}- 
il) = {1422 
2.4.9. For each of the following functions, state the domains, codomains, and how they 
are compositions of f,g,h from the previous exercise. 
i) {(2,2),(4,2)}. 
ii) {(9, 4), (3, 2), (6, 2)}- 
iii) {(2,6), (1, 3)}. 
2.4.10. Let A be a set with 3 elements and B be a set with 2 elements. 
i) Count all functions from A to B? 
ii) Count all injective functions from A to B. 
iii) Count all surjective functions from A to B. Repeat for functions from B to A. 
iv) Count all bijective functions from A to B. 
v) Count all functions from A to B that are injective but not surjective. 
vi) Count all functions from A to B that are surjective but not injective. 
vii) Count all functions from A to B that are neither surjective nor injective. 


2.4.11. In each part below, find f : R > R with the specified condition. 
i) f is a bijective function. 


ii) f is neither injective nor surjective. 


ei 
iii) f is injective, but not surjective. 
Mey 


iv) f is surjective, but not injective. 
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2.4.12. The pigeonhole principle states that if n items (such as pigeons) are put into 
m holes with n > m, then at least one hole has more than one item. Let A and B be sets 
with only finitely many elements. 

i) Use the pigeonhole principle to demonstrate that if A has more elements than B, 
then f : A > B cannot be injective. 
ii) Use the pigeonhole principle to demonstrate that if A has fewer elements than B, 
then f : A > B cannot be surjective. 


iii) Use the pigeonhole principle to demonstrate that if A and B do not have the same 
number of elements, then f : A — B cannot be bijective. 
2.4.13. Let A be a set with m elements and B a set with n elements. (To solve this 
problem, you may want to first examine the case m = 1, then m = 2, followed by m = 3, 
or possibly you have to start with small n, after which you will probably see a pattern. 
Once you have the correct pattern, the proof is straightforward.) 
i) Count all functions from A to B? 
ii) For which combinations of m,n are there injective functions from A to B? 
iii) For m,n as in (ii), count all injective functions from A to B. 
iv) For which combinations of m,n are there surjective functions from A to B? 
v) For m,n as in (iv), count all surjective functions from A to B. 
vi) For which combinations of m,n are there bijective functions from A to B? 
vii) For m,n as in (vi), count all bijective functions from A to B. 
2.4.14. Find an injective function f : Nt > N* that is not surjective. Find a surjective 
function g : N* + Nt that is not injective. Compare with parts (ii) and (iv) of the previous 
exercise. Does the pigeonhole principle apply? 
2.4.15. Let f: A> Band g: B->C be functions. 
i) Suppose that go f is injective. Prove that f is injective. 
ii) Suppose that go f is surjective. Prove that g is surjective. 
iii) Give an example of f,g such that go f is injective but g is not injective. 


iv) Give an example of f,g such that go f is surjective but f is not surjective. 


2.4.16. Find functions f,g : R > R such that f is surjective but not injective, g is injective 


but not surjective, and f og is bijective. 
2.4.17. Prove that if f and g are both injective functions, then f o g is also injective. 
2.4.18. Prove that if f and g are both surjective functions, then f o g is also surjective. 


2.4.19. Prove that if f and g are both bijective functions, then f og is also bijective. 
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2.4.20. Suppose f : A > B. We introduce the notation f(S) = {f(x) : « € S} for any 
subset S of A. Let C be an arbitrary subset of A. 
i) Prove f(A) \ #(C) € f(A\C). 
ii) With proof, what condition makes f(A) \ f(C) = f(A\C)? 
2.4.21. For the following polynomial and rational functions, determine the domains 
(largest sets on which the function is defined): 
i) f(w) = 2? — aah. 
ii) f() = ae. 
Gi) Fle )= Soistwecd (Gotcha! We do not know the roots of this denominator.) 


2.4.22. Prove that f : R— R given by f(x) = x? — 1 is surjective and injective. 


2.4.23. Prove that the composition of polynomial functions f,g : R — R is polynomial. 


2.4.24. Prove that the composition of rational functions f,g : R > R is rational. 


2.4.25. A function f : A > B is called invertible if there exists g : B — A such that 
go f =idy and f og =idg. 
i) Suppose that g,h: B > Aare such that gof =id4 =hof and fog =idg = fog. 
Prove that g =h. Such g is called the inverse of f. 
ii) Prove that a function is invertible if and only if it is bijective. 
2.4.26. Let f : R > R be given by f(x) = 2? — 5. Prove that f is invertible, find its 
inverse, and verify that the inverse is not a polynomial function. 


2.4.27. (Lagrange interpolation) Let c1,...,c be distinct real (or complex) numbers. 
For j € {1,2,...,n} set 


(x — c1)(@ — ca) - + - (4 — Gj-1)(@ = Cj-41) -- (@ = en) 


gj(x) = 
; (cj — €1) (Cj — C2) ++ (ej — 7-1) (Cj — C541) > + (Cj — En) 
i) Prove that g; is a polynomial function and that g;(c;) = fy TG 
4 aN 0, otherwise. 
ii) Prove that for any real (or complex) numbers dj,...,d,, there exists a polynomial g 


of degree at most n — 1 such that for alli =1,...,n, g(c;) = dj. 


2.4.28. Let f:N* > Zand g:N* > Q be defined by 
n=l if n is odd; 
= 2 D) 1 2: 
F(n) { =5;. if 2 18 even, 
and for any positive integer with prime factorization p{'---p,* let g(p{'---p,*) = 
pi) . per”. For example, the prime factorization of 1 has all e; equal to 0, so g(1) = 1. 
i) Compute f(1), f(2), f(3), (4), F(5), F(6). 
ii) Compute g(1), 9(2), 9(3), (4), 9(5), 9(6). 
iii) Prove that f is bijective. 
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iv) Assuming/knowing that prime factorization of positive integers is unique up to 
order, prove that g is bijective. 
(This proof that N* and Q* have “the same infinite number” of elements is from the 
article Counting the rationals by Y. Sagher, in American Mathematical Monthly 96 (1989), 
page 823. Another proof of this size equality appears on page 279.) 
2.4.29. When defining a function, one has to pay attention that it is well-defined. Read 
through examples below showing how a definition can go wrong. As a result, the purported 
definition of a function does not define a function at all. 
i) The codomain is not large enough, say for f : {1,2} > {1,2} with f(x) = 2?. 


ii) The function is not defined for all elements in the specified domain, say for f : R > 


R that is the inverse of the squaring function or for f : R > R that returns the 


multiplicative inverse. 

iii) The function does not return an unambiguous value. For example, let f : Z/12Z > 
Z be defined by f({a]) = a. Since [0] = [12], then 0 = f((0]) = f([12]) = 12, but 
the integer 0 is not equal to the integer 12. 

iv) The values may not exist as specified. For example, let f : {1} — Z be defined as 
f (1) is the smallest even integer greater than 4 that cannot be written as the sum 
of two primes. The reason why this function is not well-defined is that according 
to the Goldbach conjecture (see page 13) we do not know whether such a number 
exists. 

v) Let f : {1} - Z be defined as f(1) is the smallest prime number bigger than 
100°. We know from page 33 that such a prime exists, so this f is well-defined 
as a function. However, we do not know its numerical value due to computational 


limitations, which makes this function useless for some purposes. 


2.4.30. Let A be the set of all differentiable functions f : R > R. Define a relation R on 
A by fRg if f(0) = g(0). 
i) Prove that R is an equivalence relation. 
(ii)* Let S be the set of all equivalence classes. Define F': S > R by F([f]) = f(0). 
Prove that Fis a well-defined function, in the sense that if [f] = [g], then F([f]) = 
F([g|). Also prove that F is bijective. 


2.5 Binary operations 
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Definition 2.5.1. A binary operation on a set A is a functiono: Ax A— A. An 
arbitrary element of A x A is an ordered pair (a,b) for a,b € A, and when we plug that 
element into the function o, we should perhaps write o((a,b)), but normally one set of 
parentheses is removed, so we write o(a,b). But depending on the exact form of o, we often 
traditionally write ao b rather than 0o(a,b) (see examples below). 


Binary operations that we have seen are: 
+,-,:,/,xX,0,9,U, or, and, xor, =>, <>. 


Examples and non-examples 2.5.2. 
(1) +,- on N, Z, Q, R (we of course write a + b rather than +(a,b) and a- b or even 
ab rather than -(a, b)); 
(2) Subtraction — on Z, Q, R (but not on N); 
(3) Division / on Q \ {0}, R \ {0} (but not on Q, R, N*, Z\ {0}); 
(4) The additive inverse operation — is not a binary operation on R. (It is unary, 


i.e., it acts on one number at a time.) 


~! is not a binary operation on R \ {0}. (It 


(5) The multiplicative inverse operation _ 
is unary.) 

(6) Let S be a set, and let A be the set of all functions f : S + S. The composition 
(of functions) is a binary operation on A. 

(7) M,U,\ are binary operations on the set of subsets of a set S. 


(8) and and or are binary operations on the set of logical statements. 


Definition 2.5.3. Leto be a binary operation on a set A. An element e € A is an identity 
element for o if for alla € A,ace=a=eoa. 


(In this chapter we use “e” for the identity element; this is a symbol that is unrelated 
to the base of the exponential function as in Definition 7.6.6.) 


Examples 2.5.4. An identity (or is it the identity?) for + on Z,Q,No, Ris 0. An identity 
for multiplication on Z, Q, No, R is the number 1. 


An identity for composition of functions is the identity function 7d (the function with 
id(2) =x forall. 2). 

Subtraction on Z does not have an identity element because if e were an identity 
element, then 1 — e = 1 =e —1, which says that e = 0 and e = 2, which is nonsense. 

If S is a set and A is the collection of all subsets of S, then S is the identity element 
for MN and @) is the identity element for U. The binary operation \ on A does not have an 
identity unless the only element of A is the empty set. 

The theorem below resolves the problem of “an identity” versus “the identity”. 
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Theorem 2.5.5. Let o be a binary operation on A. Suppose that e and f are both 
identities for o. Then e = f. In other words, if an identity exists for a binary operation, it 
is unique. Hence we talk about the identity for o. 


Proof. Since for all a € A, eoa =a, we get in particular that eo f = f. Also, for every 
a€éA,aof=a,henceeof=e. Thuse=eof=f. 


Note: we used symmetry and transitivity of the equality relation. 


Definition 2.5.6. Let o be a binary operation on A and suppose that e is its identity. Let 
x be an element of A. An inverse of x is an element y € A such that roy=e=you. 
To emphasize what the operation is, we may also say that y is a o-inverse of x (or see 
specific terms below). 


Examples and non-examples 2.5.7. 

(1) Let o=+0n Z. Then 0 is the identity element and every element has an additive 
inverse. 

(2) Let o=+ on No. Then 0 is the identity element and only 0 has an inverse. 

(3) Let o=-onQ. Then 1 is the identity element and every non-zero element has a 
multiplicative inverse. 

(4) Let o=-on Z. Then 1 is the identity element and only +1 have inverses. 

(5) If S is a set and A is the collection of all subsets of S, then only S has an inverse 
for MN, the inverse being S itself, and only @ has an inverse for U, the inverse being 0. 

(6) Here is a new binary operation o on the set S = {a,b,c,d} presented via its 
o-table: 


a0 eSa/0 
Qaoaes!sa 
ead0ndane 
re Aaa 
a re Qa 


Note that a is the identity element, the inverse of a is a, the inverse of b is d, the 
inverse of c is c, the inverse of d is b. 


Definition 2.5.8. A binary operation o on A is associative if for all a,b,c € A, ao(boc) = 
(aob)oc. 


Section 2.5: Binary operations 79 


Examples and non-examples 2.5.9. 


(1) + and - are associative. 


(2) —,/ are not associative. 
(3) Composition of functions is associative. (See Exercise 2.4.6.) 
(4) M,U are associative. 


Theorem 2.5.10. Let o be an associative binary operation on A with identity e. If x has 


an inverse, that inverse is unique. 


Proof. Let y and z be inverses of x. Then 
y = y oe (by property of identity) 
= yo(xoz) (since z is an inverse of x) 
= (yo x) oz (since 0 is associative) 
=eoz (since y is an inverse of x) 


= z (by property of identity). 


Thus by the transitivity of equality, y = z. 


Definition 2.5.11. We say that x is invertible if x has an inverse. The (abstract) inverse 


is usually denoted x~!. 


Be careful! What is the number 57! if o equals +? 


If o is associative, we in the future omit parentheses in aoboc (or inaobocod et 
cetera), as the order of the computation does not matter. 

If o is not associative, we need to keep parentheses! For example, in Z, a—b—c—d can 
have parentheses inserted in many different ways, and four different values can be obtained. 
You may want to find specific four integers a,b,c,d for which you get four distinct values 
with different placements of parentheses. 


Notation 2.5.12. More generally, if o is associative, we may and do omit parentheses 
in expressions such as a1 0 a2 0--:0Od,, aS the meaning is unambiguous. We abbreviate 
1 +@2°*+*+@p (in this order!) to [[;_, ak. When o is addition, we abbreviate aj 0a20-++0dy, 
to )°/_, ax. Examples were already worked out in Section 1.5. 
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Notation 2.5.13. Just like for functions in Remark 2.4.8, also for arbitrary associative 
binary operation o we abbreviate aoa with a”, (aoa)oa with a, et cetera, and in general 


for all positive integers n we write 


a =a" oa Saou, 


This notation is familiar also when o equals multiplication: then 2° stands for 32. When 
o is addition, then the abstract “2°” stands for 10, but of course we prefer to not write 10 
this way; instead we write it in the additive notation 2+ 2+2-+2-+ 2, or briefly, 5-2. The 


o = ¢, (See 


empty product a? makes sense if the set has the identity, and in that case a 
Exercise 1.5.6 for the first occurrence of an empty product.) 
If a has a multiplicative inverse, then a~! is that inverse, and in that case if © is also 


associative, 


n 


a "=a @-) oa7t. 


To prove this by induction on n, we multiply 


ae) oa?) oq? = (a-@-) 5 at) 6 (a ou 2) 


=a") og” l= 


and similarly a” o (a~"—) oa!) =e, which proves that a~” = a~("—)) oa7!, 


Theorem 2.5.14. Let o be an associative binary operation on A. Let f,g have inverses. 


Then go f also has an inverse, and the inverse is f~' og™?. 


Proot. (go frog ag = 90 (fox ypog? Sgoeog* = -é.-and similarly 
f-* og 0 (go f) =e, so that (go f)"* = for og™. 
In particular, if o equals +, then for all x,y € A, 


—(x + y) = (-2) + (-y), 
and if o equals -, then 
0 a eae aa 
Theorem 2.5.15. If x is invertible, then its inverse is also invertible, and the inverse of 
the inverse is x. 


1 


Proof. By definition of inverses of x, 2~!ox = e = xox ~!, which also reads as “the inverse 


of x~! is x.” 
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Theorem 2.5.16. Cancellation. Let o be an associative binary operation on a set A, 
and z an invertible element in A. Then for all x,y € A, 


LoZS>7yo2 Se — Y; 


ZOOL SPO = r=. 


1 


Proof. We prove only the first implication. If oz = yoz, then (roz)oz7! = (yoz)oz71, 


hence by associativity, xo (zo z~') = yo(zoz!). Thus by the definition of inverses and 


identities, r= roe=yoe=y. 


Another proof of the same fact is as follows: 
eS Oe 
ro(zoz +) 
(x0 z) oz! (by associativity) 
(yoz)o 
yo 
yo 
Yy. 


gol 


(zo z~") (by associativity) 


e 


Definition 2.5.17. A binary operation o on A is commutative if for alla,b € A, aob= 
boa. 


Examples and non-examples 2.5.18. 
(1) +,- are commutative on all examples we have seen so far. (If you have seen 
matrices, you know that matrix multiplication is not commutative.) 
(2) M,U are commutative. 


(3) Function composition is not commutative (cf. Example 2.4.9). 


We end this section with an important example. The reader is familiar with manip- 
ulations below when n = 12 or n = 24 for hours of the day (we do not say “28 o’clock”), 
when n = 7 for days of the week, when n = 3 for standard meals of the day, et cetera. 


Important example 2.5.19. Let n be a positive integer. Recall the definition of Z/nZ 
from Example 2.3.9: elements are equivalence classes [0], [1], [2],...,[n — 1]. Define + on 
Z/nZ as follows: [a] + [b] = [a+ b]. Well, first of all, is this even a function? Namely, we 
need to verify that whenever [a] = [a’] and [}] = [b’], then [a + b] = [a’ + 0’], which says 
that any choice of representatives gives the same final answer. Well, a — a’ and b — b’ are 
integer multiples of n, hence (a+ b) — (a’ +b’) = (a—a’)+(b—0’) is a sum of two multiples 
of n and hence also a multiple of n. Thus [a + b] = [a’ + b’], which says that + is indeed a 
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binary operation on Z/nZ. It is straightforward to verify that + on Z/nZ is commutative 
and associative, the identity elements is [0], and every element [a] has an additive inverse 
[-a] = [n — a]. 

Similarly, we can define - on Z/nZ as follows: [a] - [b] = [a- b]. It is left to the 
reader that this is a binary operation that is commutative and associative, and the identity 
elements is [1]. The multiplication tables for n = 2,3,4 are below, where, for ease of 
notation, we abbreviate “{a]” with “a”: 


Z,/2Z: 01 Z/3Z: 0.1°2 “ZAZe =) 001.33 
0); 0 0 0; 0 0 0 Oy) TOO OG 
IE) 0S ak i) Sc eZ dh)! Qs: (Oe ES 
2) OQ 2 1 Z| De Be Me 
3/ 0 3 2 1 
Note that for multiplication in Z/4Z, [3] = [—1] is the multiplicative inverse of itself, and 


(2) has no multiplicative inverse. 


Exercises for Section 2.5 


2.5.1. Let A be the set of all bijective functions f : {1,2} — {1,2}. 


i) How many elements are in A? 


ii) Prove that function composition o is a binary operation on A. 
iii) For all f,g € A compute f og. (How many compositions is this?) 
What is the identity element? 


Verify that every element of A is its own inverse. 


1V 


Vv 


vi) Verify that o is commutative. 


SSF Re Ra RLS Ra ONY 


2.5.2. For any integer n > 3, let A be the set of all bijective functions f : {1,2,3,...,n}—> 
olay 26s ere ce 
i) How many elements are in A? 

ii) Prove that function composition o is a binary operation on A. 

iii) Find f,g € A such that fog#gof. 
2.5.3. Find a set A with a binary operation o such that for some invertible f,g € A, 
(go f) t|#g tof}. (This is sometimes called the socks-and-shoes problem. Say why?) 
2.5.4. Refer to Example 2.5.19: Write addition tables for Z/2Z, Z/3Z, Z/4Z. 


2.5.5. Write multiplication tables for Z/5Z, Z/6Z, Z/7Z. 


2.5.6. Determine all [a] € Z/12Z that have a multiplicative inverse. 


*2.5.7. Determine all [a] € Z/pZ that have a multiplicative inverse if p is a prime number. 
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2.5.8. Here is an opportunity to practice induction. Let f,g : A — A be functions, with 
f invertible. Prove that (f ogo f~1)" = fog"o f~+. (Notation is from Remark 2.4.8.) 


2.5.9. Consider the following binary operation o on {a,b,c}: 


a owes 
QR 2 ala 


[e) 
a 
b 
Cc 
i) Show that o is commutative and has an identity element. 


ii) Show that every element has an inverse and that inverses need not be unique. 
iii) Prove that o is not associative. (Hint: Theorem 2.5.10.) 


*2.5.10. Let S be the set of all logical statements. We define a relation ~ on S as follows: 
P~Q if P and Q have the same truth values in all conditions. For example, “1 = 1” and 
“2 = 2” are related via ~. 
i) Prove that ~ is an equivalence relation on S. Let A be the set of all equivalence 
classes. 

ii) Verify that and, or and xor are naturally binary operations on A. 

iii) Find the identity elements, if they exist, for each of the three binary operations. 

iv) For each binary operation above with identity, which elements of A have inverses? 


2.5.11. Define a binary operation 6 on Rasa @b=a+b+2, where + is the ordinary 


addition on R. Prove that 6 is commutative and associative. Find the identity element of 


, and for each x € R, find its inverse. 

2.5.12. Define a binary operation © on R asa©@b = a+2-a+2-b+2, where + and - are the 
ordinary addition and multiplication on R. Prove that © is commutative and associative. 
Find the identity element of ©, and for each x € R \ {—2}, find its inverse. 


2.6 Fields 


The motivation for the abstract definition of fields below comes from the familiar 
properties of the set of all real numbers. 


Definition 2.6.1. A set F is a field if it has two binary operations on it, typically denoted 
+ and .-, and special elements 0,1 € F, such that the following identities hold for all 
mn, pe F: 

(1) (Additive identity) m+0=m=0+m. 

(2) (Associativity of addition) m+ (n+p) =(m+n) +p. 

(3) (Commutativity of addition) m+n=n+™m. 

(4) (Multiplicative identity) m-1=m=1-m. 

(5) (Distributivity) m-(n+p) =(m-n)+(m-p). 
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(6) (Associativity of multiplication) m-(n-p) = (m-n)-p. 
(7) (Commutativity of multiplication) m-n=n-m. 
(8) (Existence of additive inverses) There existsr € F' such that m+r = r+m = 0. 
(9) (Existence of multiplicative inverses) If m 4 0, there exists r € F such that 
m-r=r-mal. 
(10) 140. 
0 is called the additive identity and 1 is called the multiplicative identity. 


The familiar N*, No, Z, Q, R all have the familiar binary operations + and - on them. 
Among these, N* lacks the additive identity, but all others have the additive identity 0. In 
No, all non-zero elements lack additive inverses, and in Z, all non-zero elements other than 
1 and —1 lack a multiplicative inverse. Thus N*,No and Z are not fields. 

We take it for granted (until Chapter 3) that Q and R are fields. In Section 3.9 we 
construct a new field, the field of complex numbers. There are many other fields out there, 


such as the set of all real-valued rational functions with real coefficients. A few fields are 
developed in the exercises to this section. 


Notation 2.6.2. By Section 2.5, we know that the additive and multiplicative identities 
and inverses are unique in a field. The additive inverse of m is denoted —m, and the 


~! or also 1/m. The sum n+ (—m) of 


multiplicative inverse of a non-zero m is denoted m 
n and —m is also written as n—™m, and the product n-m~! of n and m7! is also written as 
n/m. The latter two operations are also called subtraction and division. The functions 


= end “~* 


are unary (see definition on page 77) with domains F and F \ {0}, respectively. 
By Theorem 2.5.15, —(—m) = m, and for any non-zero m, + = (m7!)~! =m. 

It is standard to omit “.” when no confusion arises. Note that 2-222 +4 is different 
from 2222+ 4, but 2-2+4 is the same as 2x + 4. 

Another bit of notation: - takes precedence over addition, so that “(a-b) +c” can be 


written simply as “a-b+c”, or with the omission of the multiplication symbol, as “ab+c’. 


Theorem 2.6.3. (The other distributive property) If F is a field, then for allm,n,p € F, 
(m+ n)p = mp + np. 


Proof. (m+n)p = p(m+n) (by commutativity of multiplication) 
= pm + pn (by distributivity (5)) 


= mp + np (by commutativity of multiplication). 
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Theorem 2.6.4. If F is a field, then for allm € F,m-0=0=0-™m. 
Proof. We use the trick of adding a clever zero. 
m:0=m-0+0 (since 0 is the additive identity). 
=m-0+(m-0+(—(m-0))) (by the definition of additive inverses) 
=(m-0+m-0)+(—(m-0)) (by associativity of addition) 
= m-(0+0)+(—(m-0)) (by the distributive property) 
= m-0+(-—(m-0)) (since 0 is the additive identity) 
= 0 (by the definition of additive inverses). 


Since multiplication is commutative, it also follows that 0-m = 0. 


We cannot divide by 0. Never divide by 0. For one thing, in an abstract field, 
dividing by 0 is simply not on the list of allowed operations, and for another, n/m always 
stands for that unique element of the field, which, when multiplied by m, yields n. In other 
words, n = (n/m) -m. If m somehow — horribly — happened to be 0, then we would have 
n = (n/0)-0, and by Theorem 2.6.4, this product n would be 0. So, if we were to divide by 0, 
the only number that could possibly be divided by it is 0 itself. But — continuing the horrible 
detour — what should 0/0 be? How about the muddle in the following usage of other axioms 
that seems to require also division of 1 by 0: 0/0 = 1-(0/0) = (1-0)/0 = (0-1)/0 =0-(1/0). 
In any case, “division” by 0 is inconsistent, and not allowed. In a mathematics paper, never 
write “F” or “a/0”. 

At this stage of your mathematical life, you of course never write something like 
“3/0” (my eyes hurt seeing this!), but a common college mistake that is essentially division 
by 0 is canceling x when solving an equation such as x? = 32 to obtain only one solution 
x = 3. This cancellation was division by 0 when x was the other possible solution! Avoid 
even hidden division by 0, so that you can find all the solutions. 


Exercises for Section 2.6 

2.6.1. Verify that the set {0} satisfies axioms (1)—(9) of fields, with 0 being the additive 
and the multiplicative identity. Obviously {0} fails axiom (10). 

2.6.2. Use the set-up in Example 2.3.9. Prove that Z/2Z is a field. Note that in this field 
[2] = [0], so [2] does not have a multiplicative inverse. Also note that in this field, every 
number has a square and cube root. 

2.6.3. Use the set-up in Example 2.3.9. Prove that Z/3Z is a field. Note that in this field 
[3] = [0], so [3] does not have a multiplicative inverse. Note that in this field, [2] is not the 
square of any number. 
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2.6.4. Use the set-up in Example 2.3.9, and let n be a positive integer strictly bigger 
than 1 that is not a prime integer. Prove that Z/nZ is not a field. 


*2.6.5. Use the set-up in Example 2.3.9. Prove that Z/pZ is a field for any prime integer p. 
Note that in Z/7Z, [2] is the square of [3] and of [4]. 


2.6.6. Prove using only the axioms of fields that for any x in a field, (—1)-x is the additive 


inverse of x. 

2.6.7. Let F bea field. Prove that for any x € F’', —(—x) = x. Prove that for any non-zero 
Pe, Vile): =z. 

2.6.8. Let F bea field. Prove that for any x,y € F, (—x)-y=—(a-y) =a-(—y). (Hint: 
Use the definition of additive inverses.) 


2.6.9. Let F be a field. Prove that for any z,y € F, (—x)-(-y) =ax-y. 
2.6.10. Let x be a non-zero element of a field F. Then (—x)~! = —(a~*). 


2.6.11. Let A be a set and F a field. For any functions f,g : A — F we define new 
functions f+ 9,f-g:F + Gas (f+ g)(«) = f(x) + g() and (f + 9)() = f(x) - g(a). 
Here, the second + and - are the binary operations on F’, and the first + and - are getting 
defined. Let S be the set of all functions from A to F’. 
i) Prove that + and - are binary operations on S. 
ii) If A= F,, then S includes polynomial functions. Let T be the set of all polynomial 
functions from F' to F’. Prove that +, - and o are binary operations on T’. 
2.6.12. Let F be a field and n a non-negative integer. We define exponentiation by n 
to be a function f : F — F given as f(x) = x", where x° = 1 for all x and where for 
positive n, 2” = x-x"—!, In this exponentiation, n is called the exponent, or power, and 
x is called the base. 
i) Prove by induction on n that exponentiation is a well-defined function. 
ii) We want to define exponentiation by negative integers as well. What is the largest 
subset D of F such that 2~! is defined for all x € D. Is D = F? Why or why not? 
iii) Prover that for any integers m and n, if « € D, then (@™”)” = (a”)™. 
2.6.13. (Euclidean algorithm over arbitrary fields) Let F' be a field. Let f(x) = agp +ayxa+ 
-+»+ a,x” and g(x) = bo +bix+---+bynx™ for some ao, @1,.--,An, bo, b1,...,6m € F and 
with anbm # 0. 
i) Suppose that m,n > 1. Prove that there exist polynomials q(x) and r(x) such that 
f(x) = q(x)-g(a)+r(ax) and such that the degree of r(x) is strictly smaller than m. 


ii) Prove that there are at most n elements c in F' such that f(c) = 0. 
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2.6.14. (Degree of a polynomial function) Let F' be an infinite field. Let f: F > F bea 


polynomial function given by f(z) = a9 +a1”+---+a,2x” for some non-negative integer n 


and @1,@2,...,@n € F. 
i) Suppose that f is the zero function. Prove that a9 = a, =-:: =a, = 0. (Hint: 
Exercise 2.6.13.) 
ii) Prove that the coefficients ao,a1,...,@,, are uniquely determined. In particular, 


the degree of a polynomial function is uniquely determined. 
2.6.15. (Degree of a polynomial function) 
i) Let f,g : Z/2Z — Z/2Z be defined by f(x) = x”, g(x) = x. Show that f and g are 
an identical polynomial function given by polynomials of different degrees. 
ii) Find a non-zero polynomial p(x) of degree 3 that equals the zero function on Z/2Z. 
2.6.16. (An unusual field.) Let and © be binary operations on R as defined in Exer- 
cises 2.5.11 and 2.5.12. Prove that R is a field with these two binary operations. 


2.7 Order on sets, ordered fields 


If < is a relation on a set S, we define relations <,>,> on S' by 
a <b means that a<bora=b. 
a > b means that b < a. 
a > 6 means that b <a. 
Conversely, if < is a relation on S, then we define < on S' by 
a < bif and only ifa<bandaFb, 


which by before also defines >, >. Similarly, each of >, > determines all four relations of 
this form. Thus one of these relations on a set S implies that we have all four relations 
naturally derived from the one. These relations impose the familiar notion of order. 


We are familiar with these relations <,<,>,> in R. We can also use them in other 


contexts: 


Examples 2.7.1. 

(1) < can be the relation “is a proper subset of” on a set S of all subsets of some uni- 
versal set U. In this case, < means “is a subset of’, > means “properly contains” , 
and > means “contains”. 

(2) If < is the relation “has strictly fewer elements” on the set S of all subsets of the 
set {1,2,3,...,100}, then < means “has fewer elements or is the same set” (rather 
than “has fewer or the same number of elements”). 


88 Chapter 2: Concepts with which we will do mathematics 


Definition 2.7.2. Let < be a relation on a set S. 

An element b € S is called an upper bound (resp. lower bound) of T (in S) if for 
allt €T,t <b (resp. t > b).* 

A subset T of S is bounded above (resp. bounded below) (in S) if there exists 
an upper bound (resp. lower bound) of T in S. 

A set that is bounded above and below is called bounded. 

An element c € S is called a least upper bound, or supremum, of 7, if it is an 
upper bound of T, and if for all upper bounds b of T, c < b. Ifc € T, then c is also called 
the maximum of 7. 

An element c € S is called a greatest lower bound, or infimum, of 7, if it is a 
lower bound of T,, and if for all lower bounds b of T, b< c. If c€ T, then c is also called 
the minimum of T. 

The obvious standard abbreviations are: lub(T’) = sup(T), glb(T) = inf(T), max(T), 
min(T), possibly without parentheses around T. 


Examples 2.7.3. 
(1) The set No has minimum 0. It is not bounded above, for any upper bound u 
would be strictly smaller than the positive integer [|u| + 1 (the ceiling function), 
thus contradicting the assumption of upper bounds. 
(2) The set T = {1/n: n € N*} has maximum 1, it is bounded below, the infimum 
is O, and there is no minimum. 
Proof: In long form, the set equals {1, 1/2, 1/3,1/4,1/5,...}. From this re-writing 
it is clear that 1 is the maximum, that 0 is a lower bound and that 0 is not in the 
set, so 0 cannot be the minimum. Why is 0 the largest lower bound, i.e., why is 
0 the infimum of T? Suppose that r is a positive real number. Set n = [4] +1. 
Then n is a positive integer, and n > 4. By cross multiplying we get that r > - 
which proves that r is not a lower bound on J’. Since r was arbitrary, this proves 
that no positive number is a lower bound on 7’, so that 0 is the greatest lower 
bound on T. 


(3) The set {1/p: pis a positive prime number} has maximum 1/2 and infimum 0. 
(There are infinitely many prime numbers; see the proof on page 33.) 
(4) The sets {(—-1)” : n € N*} and {sin(x) : 2 € R} both have maximum 1 and 


minimum —1. 


(5) The set of all positive rational numbers that are strictly smaller than 7 has infimum 


0 and supremum 7, but it has no minimum and no maximum. 


* A sentence of the form “P is Q (resp. Q’) if R (resp. R’)” is shorthand for two sentences: “P is Q if R” 
and “P is Q’ if R’”. 


(6) 
(7) 
(8) 
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The set {e” : x € R} has no upper bound, it is bounded below with infimum 0 and 
no minimum. 

The empty subset has no minimum nor maximum. Every element of S is vacuously 
an upper and a lower bound of the empty set. 

The set {2 € R: -3 < «—5 < 3} has no minimum and maximum, but the 
infimum is 2 and the supremum is 8. The set {2 € R: —3 < x—5 < 3} has 
minimum 2, supremum 8, and no maximum. The set {2 € R: -3< «—5 <3} 


has maximum 8, infimum 2, and no minimum. The set {1 € R: -3< x—5< 3} 


has minimum 2 and maximum 8. 

If T = {{}, {1}, {2}}, then the inclusion relation on T has minimum {}, and no 
upper bounds in T. If we think of T as a subset of the set S of all subsets of {1,2} 
(or of the set S of all subsets of R), then T has supremum {1, 2}. 

If S is the set of all subsets of the set {1,2,3,...,100} and < is the rela- 
tion “has strictly fewer elements”, then the empty set is the minimum and 


{1,2,3,...,100} is the maximum. If T is the subset of S consisting only 
of sets with at most two elements, then the minimum of T is the empty 
set, and there is no maximum or supremum in 7’. The le elements 
{1,2}, {1, 3},...,{1, 100}, {2,3}, {2,4},..., {99,100} are each greater than or 
equal to all elements of T and they are not strictly smaller than any other element 
GET. 


In the sequel we restrict < to relations that satisfy the trichotomy property: 


Definition 2.7.4. A relation < on a set S satisfies the trichotomy if for all s,t € S, 
exactly one of the following relations holds: 


s=t, s<t, t<s. 


Examples 2.7.5. 


(1) 
(2) 


(3) 


The familiar < on R satisfies the trichotomy. 

If S is the set of all subsets of a universal set U, then the inclusion relation on S 
satisfies the trichotomy. 

If S = {{}, {1}, {2}}, then the inclusion relation on S does not satisfy the tri- 
chotomy. 
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Theorem 2.7.6. Let < on S satisfy the trichotomy. Then a supremum (resp. infimum) 
of a non-empty subset T of S, if it exists, is unique. 


Proof. Suppose that c,c’ are suprema of T in S. Both cand c’ are upper bounds on TJ’, and 
since c is a least upper bound, necessarily c’ < c. Similarly c < c’, so that by trichotomy 
c=c’. This proves that suprema are unique, and a similar proof shows that infima are 


unique. 
Why did we assume that the subset T of S above be non-empty? By definition every 
element of S is an upper bound for @, so in particular if S has no minimum, then @ has no 


least upper bound. 


Definition 2.7.7. A set S with relation < is well-ordered if inf(T) = min(T) for every 
non-empty subset T of S. The element min(T) is called the least element of T’. 


Examples 2.7.8. 

(1) Any finite set with relation < is well-ordered (simply check the finitely many 
pairings for which element is smaller). 

(2) Z is not well-ordered as there is no smallest whole number. 

(3) Similarly, the set of all positive rational numbers is not well-ordered. 

(4) No is well-ordered because for any non-empty subset T of No, by the fact that 
the set is not empty there exists an element n € JT, and after that one has to 
check which of the finitely many numbers 0,1 through n is the smallest one in 7’. 
Similarly, N* is well-ordered. (Chapter 3 covers this more rigorously.) 


Definition 2.7.9. Let F be a set with a binary operation +, with (additive) identity 
0 € F, and with a relation < satisfying the trichotomy. Define Ft = {x € F: 0 < x}, and 
F- ={xe€F:x2< 0}. Elements of F* are called positive, and element of F~ are called 
negative. Elements of F \ F* are called non-positive and element of F \ F'~ are called 
non-negative. 
We define intervals in F' to be sets of the following form, where a,b € F witha < b: 

(a,b)={xeEF:a<2< bd}, 
(a,b. = {xe F:a<a< bd}, 

Se Fg pe bh 
a,bl={xeEF:a<a< bd}, 
(a,coo) ={xeEF:a<z}, 
,~ovHs{verra<z}, 
bade Peg bh 
|= {ee Fea < bt. 
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Definition 2.7.10. We say that a field F' is an ordered field if it has a relation < with 
the following properties: 
(1) < satisfies the trichotomy, i.e., for all x,y € F’, exactly one of the following is true: 


Le Ue 


(2) (Transitivity of <) For allz,y,z¢ F, ifu <y andy < z then a < z. 
(3) (Compatibility of < with addition) For all x,y,z ¢ F, ife <y thenx+2z< 
ytz. 
(4) (Compatibility of < with multiplication by positive elements) For all 
x,y,z€F, ife<yand0 < z then xz < yz. 
A subset of an ordered field is called an ordered set. 


Theorem 2.7.11. Let F' be an ordered set. 
(1) For x € F with the additive inverse —x € F', x € F'* if and only if —x € F~, and 
x € F~ if and only if —x € F*. 
Ohi: 
(3) Forx € F with the multiplicative inverse x~! € F, x € F* if and only ifx—! € Ft, 
and x € F~ if and only if x-! € F-. 


Proof. (1) x € F* if and only if 0 < x, and by compatibility of < with addition this implies 
that —z =0-—2<a2—x2=0,s0 that —x € F-. The rest of (1) is equally easy. 

(2) By assumption 1 4 0. If 1 ¢ F'*, then by trichotomy 1 < 0, and by (1), 0 < -1. 
Thus by compatibility of < with multiplication by positive numbers, since —1 is supposedly 
positive, 0 = 0-(—1) < (—1)-(—1). By Exercise 2.6.9, (—1)-(—1) = 1, which by transitivity 
says that 0 < 1. Since we also assumed that 1 < 0, we get a contradiction to the trichotomy. 
So necessarily 1 € FT. 

(3) Suppose that x € F'*. By trichotomy then exactly one of the following three 


inequalities holds: x~! < 0, x = 0, 2~ > 0. Let O stand for the correct inequality (or 


equality). By compatibility of < with multiplication by the positive number zx, we then 
have l=x-2-!'Oa2-0=0. By (2), the relation 0 must equal >, so that x~! > 0. 

If instead x € F~, then by (1), —a# € Ft, and by what we have proved of (3), 
(-2)-1 € F*. By Exercise 2.6.10 then —z—! = (—x)~1 € Ft, so that 2—1 € F7~ by (1). 


Theorem 2.7.12. Let F' be an ordered field. 
(1) Forz,y€ Ft,x2+y is also in F*. 
(2) For x,y € F*, x-y is also in FT. 
(3) Forz,yeF ,x+yeF. 
(4) Forz,yeF-,x-yeF*. 
(5) Forve€ Ft andye€ F-,ax-yeF. 
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Proof. (1) By assumption, 0 < x and 0 < y. Then by compatibility of < with addition, 
y=O+y<a2+y, and since 0 < y, by transitivity of <,0<a+y,ie,x+yeFt. 

(2) By assumption, 0 < x and 0 < y. Then by compatibility of < with multiplication 
by positive numbers, 0 = 0-y < x-y, which proves that x-y € F*. 


The proofs of the rest are similar. 


Exercises for Section 2.7 
2.7.1. Prove that if < is transitive then < is transitive. 
2.7.2. Prove that < (resp. <) is transitive if and only if > (resp. >) is transitive. 


2.7.3. (An exercise of this flavor is relevant in computing limits as in Section 4.1.) Under 
what conditions is the minimum of {0.2, €/7} equal to 0.2, and when is the minimum €/7? 
Similarly determine min{0.2, €/7, e7/4}. 


2.7.4. For each of the subsets of R below, determine its minimum, maximum, infimum, 


supremum, if applicable. Justify all answers. 
i) {-1,2,7,—7}. 
{(-1)"/n: ne N*}. 
iii) The set of all positive prime numbers. 
iv) {f€R:-l<a< 5}. 
v) {tER:2<2< 5}. 


ii) 
i) 
) 
) 
vi) {x €Q: 2? < 2}. 
i) 
i) 
) 


vii) {ce ER: a27+2-1=0}. 
viii) {v €Q:a27+2-1=0}. 
ix) {n/(n+1):n € No}. 
2.7.5. Suppose that a subset T of an ordered field has a minimum (resp., maximum, 
infimum, supremum) b. Prove that the set —T = {-t: t € T} has a maximum (resp., 
minimum, supremum, infimum) —b. 
2.7.6. Suppose that a subset T of positive elements of an ordered field has a minimum 
(resp., maximum, infimum, supremum) b. What can you say about the maximum (resp., 
minimum, supremum, infimum) of the set {1/t:t « T}? 
{ 2.7.7. (Invoked in Theorem 7.3.4.) Let S and T be subsets of an ordered field F’. Let 
S+T={s+t:seSandteT}. 
i) If S and T are bounded above, prove that sup(S + 7) < sup S + supT. 
ii) If S and T are bounded below, prove that inf(S + 7) > inf $+ infT. 


2.7.8. Let F’ be an ordered field. Prove that 2,3 are positive (and so not zero). 
2.7.9. Let F be a field and x € F. Prove that x? = 0 if and only if x = 0. 


Section 2.8: What are the integers and the rational numbers? 93 


2.7.10. Let F be an ordered field and x € F. Let x,y € F be non-negative (resp. non- 
positive) such that «+ y = 0. Prove that x = y= 0. 

2.7.11. Let F' be an ordered field. Suppose that x < y and p < q. Prove that x+p < y+q. 
If in addition x < y or p < q, prove that rx +p<yt+q. 

2.7.12. Let F be an ordered field and x,y € F. Prove that x < y if and only if 0 < y—~7. 
Prove that x < y if and only if0 < y— 2. 

2.7.13. Let F be an ordered field, and x,y € F* with x < y. Prove that 1/y < 1/2. 
2.7.14. Let F' be an ordered field. Suppose that x < y and that x,y are non-zero. Does 
it follow that 1/y < 1/x? Prove or give a counterexample. 

}2.7.15. (In-betweenness in an ordered field) Let F’ be an ordered field. Let x,y € F' with 
x <y. Prove that x < («+ y)/2 < y. (Why are we allowed to divide by 2?) 

2.7.16. Find an ordered set without a minimum. 

2.7.17. Let F be an ordered set. Prove that any non-empty finite subset S of F has a 
maximum and a minimum. Prove that for all s € S, min(S') < s < max(S). 

2.7.18. Let n > 1 be an integer and F = Z/nZ. (This was defined in Example 2.3.9.) 
Prove that F is not an ordered set. In particular, using Exercise 2.6.5, for any prime 
integer p, Z/pZ is a field that is not ordered. 


2.8 What are the integers and the rational numbers? 


So far we have taken it for granted that elements of No and Z are special and 


until Chapter 3 we take it for granted that Q and R are ordered fields. This section contains 


a formal definition of No as a subset of R with derivations of a few important properties. 
I recommend covering this section very lightly if at all. Construction of No independent 


of R is done formally in the next chapter, and we also derive there all the properties from 


this section. Regardless of whether you read this section or not, you should be able to do 
all the exercises at the end. 
Once we have a definition of the set No of non-negative integers, we define the set Z 


of all integers as No U{n € R : —n ©€ No} and the set Q of all rational numbers as 


{z-yo? 
We accept that R is an ordered field (and we prove this formally in Theorem 3.8.2). 


:a € Z,y €N‘}. In this section we derive no special properties of Z or of Q. 


Definition 2.8.1. A subset T of R is called inductive if 0 € T and if for every element 
n of T, n+1 is also in T. 


Examples of inductive sets are R and Rt U {0}. 
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Theorem 2.8.2. There exists an inductive subset No of R that is a subset of every induc- 


tive subset of R. 


(1) No is the smallest inductive subset of R (in the sense that any inductive subset of 
R contains No). 

(2) If m € No is non-zero, then m = n-+ 1 for some n € No. 

(3) All elements of No \ {0} are positive. 


Proof. The collection § of inductive subsets of R is a non-empty set because it contains R. 
We define No as the intersection of all sets in S. Since 0 is in every inductive set, then 0 


is also in their intersection No. If n € No, then n is in every inductive subset of IR, and so 


by the definition of inductive sets, n + 1 is in every inductive subset of R, and so it is in 
their intersection No. This proves that No is an inductive set. By definition it is a subset 


of every inductive subset of R. This proves (1). 
Suppose that m € No is not O and is not equal to n + 1 for any n € No. Let 
T =No \{m}. Then 0 € 7, and ifn € T, then n+ 1 € T, so that T is an inductive set. 
But No is a subset of every inductive set, so that m € No would have to be in 7, which is a 
contradiction. Thus every non-zero m € No equals n+ 1 for some n € Ng. This proves (2). 
Let T = No \R-. Then 0 € T. If n € T, then by trichotomy, n = 0 or n € R*, 
and hence n+1€R* NT. Hence T is inductive. Since No is contained in every inductive 


set, it follows that No C T, so that No contains no negative numbers. By trichotomy this 


proves (3). 


Theorem 2.8.3. Let n © No. There are no elements of No strictly between n and n+ 1. 


Proof. Let T be the subset of No consisting of all n that satisfy the property that there are 
no elements of N strictly between n and n+ 1. We will prove that T is an inductive set. 

Suppose that there exists m € No strictly between 0 and 0+ 1 = 1. Then by 
Theorem 2.8.2, m = p+ 1 for some p € No. By compatibility of order with addition, 
p=m-—1<1-—1=0, contradicting Theorem 2.8.2 which asserts that elements of No are 
non-negative. This proves that 0 € T. 

Now suppose that n € JT. We want to prove that n+1 € T. Suppose for contradiction 
that there exists m € No that is strictly between n+ 1 and (n+1)+1. Sincem >n+1>1, 
m is not zero, so that by Theorem 3.1.6, m = p+1 for some p € No. Then by compatibility 
of order with addition, p is strictly between n and n+ 1, which contradicts the assumption 
that n € T. So necessarily there is no m with the stated property, so that n+1€ T. 

This proves that T is an inductive subset of No, and since No is contained in every 


inductive subset, the theorem is proved. 
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Theorem 2.8.4. Ifn,m € No and n <m, then m—n € No. 


Proof. Let T= {nE€No: ifmeNo andn<m, then m—n€ No. We will prove that T 
is an inductive set. Certainly 0 © T as m—0= mm. Now suppose that n € T. We claim 
that n+ 1¢€T. Namely, let m € No such that n+ 1< m. Necessarily 1 < m so that by 
Theorem 3.1.6, m = p+1 for some p € No. By compatibility of < with addition then n < p, 
and since n € T, it follows that p—n € No. Hence m—(n+1) = p+1—(n+4+1) =p—neENo. 
This proves that n+1 € JT. Since n was arbitrary, this proves that T is an inductive subset 
of No, and since No is a subset of every inductive set we have that JT’ = No. This proves 


the theorem. 


Theorem 2.8.5. (The well-ordering principle) No is well-ordered (see Definition 2.7.7). 
In other words, every non-empty subset S of No has a least element, that is, S contains 
an element r such that for allt € S, r <t. 


Proof. We will prove that the following set is inductive: 
T={neENo: if SCNp andn€ 5S, then S has a least element}. 


Note that 0 € T’ because 0 < n for all n € No and hence 0 < n for all ne S. 

Suppose that n € T. We next prove that n+1¢€T. Let SCNo andn+1e€S. By 
assumption that n € T’, it follows that the set SU{n} has a least element. Thus there exists 
r € SU {n} such that for allt € S,r<tandr<n. Ifr € S, then we just showed that 
for all t € s, r <t, so that S' has a least element. Now suppose that r ¢ S. So necessarily 
r =n, and this is not an element of S. Then we claim that n + 1 is the least element of 
S. Namely, let t € S. We need to prove that n +1 < t. Suppose for contradiction that 
t<n+1. Since n is the least element of SU {n}, it follows that n < t. Thus we have 
n<t<n+1, so that by Theorem 3.1.7, necessarily n = t. But then n = t € S, which 
contradicts the assumption that n ¢ S. Thus n+ 1 < t, and since t was an arbitrary 
element of S, it follows that n+ 1 is the least element of S. Thus in all cases, ifn+1€S, 
the set S has a least element. Son+1¢€ 7. Thus T is an inductive set, and hence equal 


to No. This means that every non-empty subset of No has a least element. 


Exercises for Section 2.8 

You should be able to do these problems without reading the section. 

2.8.1. Let F be an ordered field and let x € F' satisfy x > 1. Prove that for all positive 
integers n, 2” > land 2"t! >a. 

2.8.2. Let F be an ordered field and let x € F satisfy 0 < x < 1. Prove that for all 
positive integers n,0 <a" <landa™t! <a. 
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2.8.3. (Bernoulli’s inequality) Prove that for all x € Ryo and all n € No, (1+ 2)” > 
1+ ne. 

2.8.4. Does the set {2”/n :n € N*} have a lower (resp. upper bound)? Is it well-ordered? 
Justify. Repeat with {n/2”:n € NT}. 


2.8.5. Prove that for all n € Z there exist no integer strictly between n and n+ 1. 
(Hint: if n is negative, then the interval (n,n +1) can be mirrored across 0 to the interval 
(—n —1,(-n— 1) +1).) 


2.8.6. Prove that for all x € R, the interval (2,x +1) can contain at most one integer. 


2.9 Increasing and decreasing functions 


Definition 2.9.1. Let F,G be ordered sets (as in the previous section), and let A C F. 
A function f : F — G is increasing (resp. decreasing) on A if for allz,y € A,u<y 
implies that f(x) < f(y) (resp. f(x) > f(y)). Iffurthermore f(x) < f(y) (resp. f(x) > f(y) 
for allx < y, then we say that f is strictly increasing (resp. strictly decreasing) on A. 
A function is (strictly) monotone if it is (strictly) increasing or (strictly) decreasing. 


Theorem 2.9.2. Let n be a positive integer and F' an ordered field. Then the function 
f : F — F defined by f(x) = x” when restricted to F* U {0} is strictly increasing with 
the range in Ft U {0}. 


Proof. Let x,y € Ft U {0} with x < y. Then by Exercise 1.7.7, 


k=0 
= 3 Gre — a)" = (y—a)? + bs (i) a ee ame 


Since y — x is positive, by Theorem 2.7.12, (y—«x)” is also positive. Since in addition x > 0 
and (7) is a non-negative integer, then by the compatibilities of >, BS (R)ak(y—ax)r—* > 
0. Thus f(y) — f(x) > 0. 


Section 2.9: Increasing and decreasing functions 97 


Corollary 2.9.3. Let n be a positive integer and F' an ordered field. Suppose that x,y € 
F* U{0} have the property that x” < y”. Then x < y. 


Proof. If x = y, then x” = y”, which contradicts the assumption and trichotomy. If x > y, 
then by Theorem 2.9.2, 7” > y”, which also contradicts the assumption. So by trichotomy 


gr<y. 


Theorem 2.9.4. If F,G are ordered sets and f : F — G is strictly monotone, then f is 
injective, and there exists a strictly monotone function g : Range(f) — F such that for all 
x € F, (go f)(«) = 2 and for all y € G, (f 0 g)(y) = y. In other words, g is the inverse of 
the function f : F + Range(f). 


Furthermore, f is increasing if and only if g is increasing. 


Proof. Let y € Range(f). Then y = f(x) for some x € F’.. If also y = f(z) for some z € F, 
since f is strictly monotone, x = z. So f is injective and x is unique. Thus we define 
g: Range(f) > F by g(y) = xz. Then by definition for all  € F, g(f(x)) = x and for all 
y € Range(f), f(g(y)) = y. If f is increasing and y1, y2 € Range(f) such that y, < yo, then 


g(y1) < g(y2) for otherwise by the increasing property of f, yi = f(g(y1)) = f(g(y2)) = v2, 
which is a contradiction. Thus if f is increasing, so is g = f~'. Thus if g = f ! is 


increasing, so is f = (f~')~!. The same reasoning goes for the decreasing property. 


If the exponentiation function in Corollary 2.9.3 with exponent n takes F*+U{0} onto 
F* U{0}, then by Theorem 2.9.4, we can define its inverse function Ft U {0} > F* U {0}. 
However, the function need not be surjective or have an inverse, witness F = Q and n = 2 
as proved on page 21. 


Remark 2.9.5. Let F' be an ordered set and G an ordered field. Let f,g:F 7G 
be functions. Below we need the definitions of the sum and product of functions (see 
Exercise 2.6.11). 
(1) If f,g are both strictly increasing (resp. both decreasing), then f + g is strictly 
increasing (resp. decreasing). 
(2) If f,g are both strictly increasing (resp. both decreasing) and always take on 
positive values, then fg is strictly increasing (resp. decreasing). 
(3) If f,g are both strictly increasing (resp. both decreasing) and always take on 
negative values, then fg is strictly decreasing (resp. increasing). 


Proof of (3): Let x,y € F with « < y. Suppose that f and g are both increas- 
ing functions, so that f(x) < f(y) < 0 and g(x) < g(y) < 0. Then —f(y), —g(x) 
are positive numbers, so by compatibility of < with multiplication by positive numbers, 
f(x)(-9(2)) < fly)(-g(@)) and (—f(y))9(z) < (—f(y))g(y). By Exercise 2.6.8, this 


98 Chapter 2: Concepts with which we will do mathematics 


says that —(f(x)g(x)) < —(f(y)g(x)) and —(f(y)g(z)) < —(f(y))g9(y)). By transitiv- 
ity of < then —(f(x)g(x)) < —(f(y))g(y)). By compatibility of < with addition, by 
adding f(x)g(x) + f(y)g(y) we get that f(y)g(y) < f(x)g(x). With function notation, 
(fa)(y) < (fg)(x), and since x and y were arbitrary, this says that fg is strictly decreas- 


ing. The proof in the case where both f and g are strictly decreasing is similar. 


Exercises for Section 2.9 


2.9.1. Let n be an odd positive integer and F' an ordered field. Prove that the function 
f :F — F defined by f(x) = x” is strictly increasing. 


2.9.2. Let n be an even positive integer and F' an ordered field. Prove that the function 
f:F- U{0} > F defined by f(x) = x” is strictly decreasing. 

2.9.3. Let n be an odd positive integer and F’ an ordered field. Suppose that a,b € F’ and 
that a” < 6”. Prove that a < b. 


2.9.4. Let F be an ordered field, a € F'* and f : F — F defined by f(x) = ax. Prove 
that f is a strictly increasing function. 


2.9.5. Let F be an ordered field, a € F~ and f : F — F defined by f(x) = ax. Prove 
that f is a strictly decreasing function. 


2.9.6. Prove that the composition of (strictly) increasing functions is (strictly) increasing. 
Prove that the composition of (strictly) decreasing functions is (strictly) increasing. 


2.9.7. Prove that the composition of a (strictly) increasing function with a (strictly) de- 
creasing function, in any order, is (strictly) decreasing. 


2.9.8. Suppose that f : F > G is strictly monotone. 
i) Let B be a subset of F’, and define g : B > G by g(x) = f(x). Prove that g is 
strictly monotone. 
ii) Define h : B > Range(g) by h(x) = f(x). Prove that h is bijective. 


2.9.9. Give an example of a non-decreasing function f : R > R that is not injective. 


2.10 Absolute values 
Definition 2.10.1. Let F' be an ordered field. The absolute value function | |: F > F 
is a function defined as 
0; fo =O; 
xt ifee Ft; 


—x; ifxveF. 
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This defines the absolute value function on the ordered fields Q and R. We think of 
|x| as the distance of x from 0 on the real number line. 

The following theorem lists the familiar properties of absolute values, and the reader 
may wish to prove them without reading the given proof. 


Theorem 2.10.2. Let F be an ordered field. 
(1) For all x € F, |x| > 0. Furthermore, x > 0 if and only if x = |x|; and x < 0 if and 
only if x = —|z|. (In particular, |1| = 1.) 
(2) For all x € F, |x| =|—-2]. 
(3) For alla € F, —|z| <a < |a|. 
(4) For all x,a € F, |x| < a if and only if -a<a <a. 
(5) For all z,a € F, |x| < a if and only if -a<a2 <a. 
(6) For all x,y € F, |xy| = |a|[yl. 


Proof. (1) is from the definition. 


(2) Certainly |0| = | — 0| = 0. Suppose that x € F~. Then |2| = —x and —x € Ft 
so that | — 2| = —x. Thus |z| = —x = |-— a2). If x € F*, then —z € F-, so that by what 
we just proved, | — x| =| — (—2z)| = |a]. 


(3) If x > 0, then |z| = a, and if x < 0, then x < 0 < |a|. Thus by transitivity for 
all x, x < |x|. In particular, when applied to —2, this says that —x < | — x| = |x|, and by 
adding x — |x| to both sides we get that —|z| < x. 

(4) Suppose that |x| < a. Then by (3) and transitivity, « < a, and —x% < |-—2|= 
|x| <a, so that by transitivity and adding x — a to both sides, —a < a. 

(5) The proof of (5) is similar to that of (4). 

(6) We may choose r and s € {1,—1} such that rx > 0 and sy > 0. Then 


Izy] = | + (xy)| (by (2)) 
= |(rx)(sy)| 
= (rx)(sy) (by Theorem 2.7.12) 
= |ra|-|sy| (by the definition of r, s) 
= |||y| (by (2). 


The last part of the theorem above shows that the absolute value works well with 


multiplication: the absolute value of the product is the product of absolute values. It is 
not the case that the absolute value of the sum of two numbers is always the sum of their 
absolute values. Instead we have triangle inequalities as in the theorem below. We use the 
standard notation “+” to mean that the result holds with either + or —. 
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Theorem 2.10.3. The following inequalities hold for an ordered field F’. 
(1) Triangle inequality: For all x,y € F, |u+y| < |x| + |y|. 
(2) Reverse triangle inequality: For all x,y € F, |jx+y| > ||x| — |y|| = |ly| — |a|I. 


Proof. (1) By the first part of the previous theorem, —|z| < x < |z| and —|y| < y < |y|. 
Thus 
—(|2| + |yl) = (-I21) + (-lyl) (by Theorem 2.5.14) 

<a«+(-|y|) (by compatibility of < with addition) 

<a+t+y (by compatibility of < with addition) 

< |x| + y (by compatibility of < with addition) 

< |x| + |y| (by compatibility of < with addition), 
so that by transitivity, —(|x|+ |y|) <«+y < |z|+|y|. Thus by part (4) of the previous 
theorem, |x + y| < |z| + |y|. It follows that | — y| = ja + (—y)| < |z| + |—y|, and by the 
second part again this is equal to || + |y|. 

(2) By (1), la] = |e@+y—+y| < |e +y| + |yl, so that |x| — |y| < ja ty]. Similarly, 

ly| — |x| < |y+<a]|. But by the second part of the previous theorem, |y— z| = |— (y—2)| = 


|x —y| and |y+2| =|r+y|, so that |x+y| > |x| —|y| and |2+y| > |y| — |x| = —(|2| — |y)). 


Since ||z| — |y|| is either |a| — |y| or |y| — |a|, (2) follows. 


Observe that the proof of the reverse triangle inequality above used the triangle 
inequality and did not require referencing Theorem 2.10.2. This made the proof shorter. 


Theorem 2.10.4. Let F be an ordered field. Let r € F. 
(1) Ifr <e for alle € F*, thenr <0. 
(2) Ifr > —e for alle € F*, then r > 0. 
(3) If |r| < ¢ for alle € Ft, thenr =0. 


Proof. Proof of (1): By Theorem 2.7.11, 1 € F'*, so that 0 < 1, and by compatibility of 
< with addition, 1 < 2. Thus by transitivity of <, 0 < 2, so that 2 is positive, and by 
Theorem 2.7.11 (3), 2-1 € Ft. If r > 0, by compatibility of < with multiplication by 
positive numbers, « = r/2 is a positive number. By assumption, r < « = r/2. Again by 
compatibility of < with multiplication, by multiplying through by 2r~!, we get that 2 < 1, 
which contradicts the trichotomy (since we already established that 1 < 2). Thus r ¢ FT, 
so that r < 0. 

The proof of (2) is similar. 

For (3), if |r| < € for all e € F*, then —e < r < . Then by (1) and (2),0<r<0. 
Since Ft 1 F~ =O by trichotomy, it follows that r = 0. 
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Exercises for Section 2.10 

2.10.1. Let F be an ordered field. Prove that the absolute value function on F is increasing 

on F’* U {0} and decreasing on F’~ U {0}. 

2.10.2. Let F be an ordered field. 
i) Prove that for all a,b € F, 


lal + [61] = [al + |b}. 
ii) Prove that for all a,b € FT, 


lal +l] =a +0. 
iii) Prove that for alla,be F-, 


lal + [2 =-a-—b. 


2.10.3. Let F be an ordered field, a € F andr € F'*. Express the sets {x € F': |r—a| < r} 
and {x € F’: |x —a| < r} in interval notation. 


2.10.4. (Triangle inequality) Let F’ be an ordered field and aj,...,a,) € F’. Prove that 
jay +@9+++++4@p| < [ai] + [a2] +--+ lanl. 


2.10.5. (Reverse triangle inequality) Let F' be an ordered field and aj,...,a, € F’. Prove 
that 


Jay + a2 +--+ +4p| > |a1| — |az| — +++ — [an]. 


Give an example of real numbers a1, a2, a3 with |a; + a2 + a3| Z |/ai| — Jag] — |asl]. 


2.10.6. Give an example of a set S in R that is bounded above but {|s| : s € S} is not 
bounded above. 
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In Chapter 1 we laid the basic groundwork for how we do mathematics: how we 
reason logically, how we prove further facts from established truths, and we learned some 
notation. In Chapter 2 we introduced sets, and from those derived functions and binary 
operations that play a big role in mathematics. In this chapter, we use set theory to 
derive numbers and basic arithmetic: on Np (non-negative whole numbers), Z (integers), 


Q (rational numbers), R (real numbers), C (complex numbers), in this order. 


When I teach this course, I go lightly through the first eight sections of this very long 
chapter. Dedekind cuts give straightforward proofs, but if one wishes to avoid Dedekind 
cuts, then these theorems can (and should) be simply accepted as facts. Complex numbers 
play a central role in the rest of the book, so Sections 3.9 through 3.12 should be studied 
thoroughly. Sections 3.13 and 3.14 introduce topology, and they contain more information 
than strictly necessary for the rest of the book. 


3.1 Inductive sets, a construction of natural numbers 


Inductive sets present a new way of thinking about mathematical induction from 
Section 1.6. 

We start with the most basic set: @. We can make the empty set be the unique 
element of a new set, like so: {@}. The sets @ and {0} are distinct because the latter 
contains the empty set and the former contains no elements. With these two distinct sets 
we can form a new set with precisely these two elements: {Q, {0}}. This set certainly differs 
from the empty set, and it also differs from {0} because {0} is not an element of {0} (but 
it is a subset — think about this). So now we have three distinct sets, and we can form 
another set from these: {0, {0}, {0, {0}}}, giving us distinct sets 


0, {0}, {0,103}, {0,10}, (0, {OF}. 


We could call these Zeno, Juan, Drew, Tricia, but more familiarly these sets can be called 
zero, one, two, three, and written 0, 1, 2, 3 for short. At this point, we are simply giving 
them names, with no assumption on any arithmetic properties. We could say “and so on”, 
but that is not very rigorous, is it. 
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Definition 3.1.1. For any set S, its successor S* is defined as St = SU {S}. 


By our naming convention, 0 is not the successor of any set, 0T = 1, 17 = 2, 27 =3. 
Wer-can also wate Ot S00) a 1 = and OCS (0 ry a I) ae eS. 
At this point I rely on your basic training for the naming conventions of 3* = 4, 
Beh Shae 


The definition of successors of successors of successors... is recursive. We can see 
informally that the set of all successive successors of 0 is infinite, in fact we hope that this 
set would be the familiar No, but to get to that we would have to take a union of infinitely 
many sets that may not be elements of some universal set. Taking unions of infinitely many 
sets is subject to pitfalls, just like infinite sums have pitfalls. 


Definition 3.1.2. A set of sets J is called inductive if it satisfies two conditions: 
(1) Oe. 


(2) For any n € J, the successor n* of n is also in J. 


We accept as a given (we take it as an axiom) that there exists a set that contains 0) 
and all its successors: 


Axiom 3.1.3. There exists an inductive set. 


An inductive set has to contain the familiar 0,1,2,3,..., but possibly it can also 
contain the familiar numbers —1.3 or 7, which are not part of the familiar (inductive 
set) No. 


Theorem 3.1.4. Let J be an inductive set. Define No as the intersection of all inductive 
subsets of J. Then No is the smallest inductive subset of J in the sense that if T is any 
inductive subset of J, then No CT. 


Proof. Each inductive set contains 0, hence their intersection contains @ as well, so that 
0 € No. If n is in No, then by definition of No, this n is in all the inductive sets, hence n* 
is also in those same inductive sets, which says that n* is in the intersection. Thus No is 
an inductive set. Furthermore, the intersection of some sets is a subset of all of those, so 


that No is a subset of each inductive subset of J. 
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Definition 3.1.5. (Definition of No) With definition of No as in Theorem 3.1.4, elements 
of No are called natural numbers. (Beware: in some books, this set is written as N, 
whereas in some other books N stands for all non-zero natural numbers. In this book, I 
always write No for clarity.) 


Exercise 3.1.4 shows that No does not depend on the choice of J. 

Some of the elements of No are: 0, {0}, {0, {O}}, {0, {0}, {0, {O}}}, or written alter- 
natively: 0,1,2,3. In fact, we prefer to write elements of No numerically, but for proofs we 
often need sets to resort to the notion of successors of sets. 


Theorem 3.1.6. 0 is not the successor of any element of No, and every element of No 
other than 0 is the successor of some element of No. 


Proof. 0 stands for the empty set, which by definition cannot be the successor of any set. 

Let n € No \ {0}. If n is not the successor of any element in No, then No \ {n} is an 
inductive set which is strictly smaller than No. But this contradicts Theorem 3.1.4. Thus 
there is no such n, which proves that every element of No \ {0} is the successor of some 


element of No. 


As the successor seems to act like “+1”, this says that if 7 were to be in No, then 
m — 1 would have to be in No as well. If we think of inclusions as “less than”s, the next 
theorem removes this possibility if we think of 7 as a number between 3 and 4. 


Theorem 3.1.7. Let m,n € No be such thatn C mC nt. Then eitherm =norm=n. 


Proof. Suppose that m 4 nt. As m is a subset of nt, this means that some x € n* = 
nU {n} is not in m. But n C m, so every element of n is in m, which means that x = n, 


and necessarily m = n. 


Recall mathematical induction from Section 1.6. The notion of inductive sets gives 
us a new perspective at induction and it enables us to eventually rigorously define the 
(usual) arithmetic. We will use the following new form of induction over and over: 


Theorem 3.1.8. (Induction Theorem) Let P be a property applicable to (some) ele- 
ments of No. Suppose that 

(1) P(0) is true; 

(2) For all n € No, (P(n) = > P(n*)) is true. 
Then P(n) is true for all n € No. 


Proof. [WE ARE EXPECTED TO USE THE NEW CONCEPT OF INDUCTIVE SETS, SO WE 
PROBABLY NEED TO DEFINE A SET THAT TURNS OUT TO BE INDUCTIVE.] Let T = {n € 
No : P(n) is true}. [WE WILL PROVE THAT THIS T' IS INDUCTIVE.] By assumption (1) we 
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have that 0 € T. [SO THE FIRST PROPERTY OF INDUCTIVE SETS HOLDS FOR T.] Ifn € T, 
then P(n) is true, and since (P(n) => P(n*)) is true, it follows that P(n*) is true. Thus 
n* € T. This means that T is an inductive set. But T is a subset of No by definition, and 
No is the smallest inductive set by Theorem 3.1.4, which means that No C T, so that P(n) 
is true for all n € No. 

We illustrate below how the inductive theorem can be used. Many more examples 
are in the next three sections. 


Theorem 3.1.9. For alln € No, 0€ n™. 


Proof. Set T = {n € No : 0 € nt}. Since 0* = {0} contains 0 = 0, we have that 0 € T. 
Now assume n € T. This means that 0 € nt C nt U {nt} = (nt)t, so that 0 € (nt)*. 
Thus n* € T. Hence the property holds for all n € No by Theorem 3.1.8. 


In more familiar language, this is saying that 0 is an element of (sets) 1,2,3, et 
cetera. 

I note that the following property is not true for arbitrary sets m and n, but it is 
true for sets that are elements of No. 


Theorem 3.1.10. Let m,n € No. Ifm €n then m Cn. 


Proof. The property in the conditional vacuously holds for n = 0 and all m € No. Suppose 
that n € No has the property that for all m € No, ifm € n then m C n. We want to prove 
that n* has the same property (as n). So let m € Ng, and suppose that m € n*. Since 
nt =nU({n}, either m € n or m = n. By the assumed property on n, this means that 


mCnCnt. Thus by Theorem 3.1.8, the theorem is proved. 


Later, in Theorem 3.1.12, we prove more: that m € n if and only if m Cn. 


Theorem 3.1.11. For alln € No, 
nén, n' Zn, nt €n. 


In particular, n 4 n*. 


Proof. Let T be the subset of No containing all n for which n gn, nt Zn, and nt ¢ n. 

Since the empty set contains no elements, it follows immediately that 0 ¢ 0 and 
0* ¢0. Furthermore, since 0* = 0 U {0} = {@}, this set is not a subset of the empty set. 
Thus 0 € T. 

Now suppose that n € T. 

Suppose (for contradiction) that nt € nt. By the definition of n*, this says that 
nt € nU{n}, so that either nt € n or nt =n. But both of these contradict the assumption 
that n € T. Thus we have proved that nt ¢ n*. 
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Suppose (for contradiction) that (n*)*t Cn*. By the definition of (n*)*, this says 


that nt U{n*} Cnt. In particular, nt € nt U{nt} C n*, which contradicts the previous 


paragraph, and thus proves that (nt)? Zn*. 

Finally, suppose (for contradiction) that (n*)t € nt. Since nt = nU {n}, this 
means that either (n*+)+ € n or (nt)* =n. By Theorem 3.1.10 both of these mean that 
(nt)* Cn, so that in particular the subset nt of (n*)* is a subset of n, which contradicts 


the assumption that n € T. 
Theorem 3.1.12. For all m,n € No, m € n if and only ifm Cn. 


Proof. Suppose that m € n. By Theorem 3.1.10 we know that m C n, and by Theo- 
rem 3.1.11 we know that m #n. Thus ifm en then m Cn. 

Set T = {n © No: for all m € No, ifm Cn then m € n}. Note that m € J andmC 
are both impossible, so that n = § = 0 € T vacuously. Now let n € T, and let m € No 
with m €n*. We want to prove that m € nt. If n € m, then by the first paragraph of 
this proof, n € m. As we also have m C n*, we get a contradiction to Theorem 3.1.7. So 
necessarily n is not an element of m. But then the assumption m € nt = nU {n} means 
that m Cn. If m =n, then m € {n} C n*, and if m Cn, then since n € T, this implies 


that m € n. But n is a subset of n*, so that m € n*. 


Theorem 3.1.13. For m,n € No, m Gn if and only if m* Cn. 


Proof. Suppose that m ¢n. By Theorem 3.1.12, this means that m € n. Thus mt = 
mU{m} Cn, and by Theorem 3.1.11, this is properly contained in nt. This proves that 
i? as 

Suppose that m* Cn*. This means that mU{m} = mt Cnt =nU {n}. Ifn Cm, 
then by Theorem 3.1.7, n = m, which contradicts the assumption mt Cnt. So n is not a 
subset of m. Thus the assumption mU {m} CnU {n} means that mU {m} C n, and thus 
m Cn. But m € n by the same assumption but m ¢ m by Theorem 3.1.11, it follows that 


1S 1s 


Theorem 3.1.14. For all m,n € No, either m Cn orn Cm. 


Proof. Let T= {meENpo: for alln € No, either m Cn orn Cm}. Certainly the empty 
set is a subset of any set, so that 0 € T. 

Suppose that m € T. We want to prove that m* € T. Let n be arbitrary in No. 
We have to prove that either m* C n or n C m*. This is certainly true if n = 0, so it 
suffices to prove this in case n 4 0. Then by Theorem 3.1.6, n = pt for some p € No. 
Since m € T, it follows that either m C p or p Cm, and by Theorem 3.1.13, this says that 


either m* C pt or pt C m*. In other words, either m* Cn or n C mt. Thus mt € T, 
and Theorem 3.1.8 finishes the proof. 
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Exercises for Section 3.1 
3.1.1. Prove that for every set S, 9S €St and S$ CST. 
3.1.2. Let T = {0, {0}, {{O}}}. Prove that T is not the successor of any set S. 
3.1.3. Prove that for all n € No, ifn #0 andn 41, then 1 € n. (Hint: set T= {me 
No: 1 € (m*)*}.) 
3.1.4. Let J and K be inductive sets. Let N be the intersection of all inductive subsets 
of J, and let M be the intersection of all inductive subsets of K. 

i) Prove that T = {n € N: n € M} is an inductive set. 

ii) Prove that T= N=M. 

iii) Prove that the definition of No is independent of the ambient inductive set J. 


3.2 Arithmetic on No 


We apply the Induction Theorem (Theorem 3.1.8) to define addition and multiplica- 
tion on No. For example, for any m € No we define the “adding m” function A,, : No > No 


as follows: 
Am(0) =m; Am(nT) = Am(n)* for n € No. 


The induction theorem says that A,,(n) is defined for all m and n: A,,(0) is in No, and if 
Am(n) € No, then A,,(n)* is in No, hence A,,(n*t) € No, so that A, is a function from No 
to No. Similarly, we define “multiplication by m” as a function M,, : No > No given by 


M,,(0) =0; Mim(n*) = Am(Mm(n)). 
With this functions we define binary operations + and - on No: 
m+n=An(n), m-n=M,,(n). 
Example 3.2.1. We will prove that 2+ 2 = 4, and you can verify that 2.2 = 4. 


24+ 2 = Ad2(2) 
= A5(17) 
= A,(1)* 


108 Chapter 3: Construction of the number systems 


Remark 3.2.2. For all m,n € No, 
m+l=mt, m+(nt)=(m+4+n)t and m-(nt)=m+4+(m-n). 


The last equality is simply rewriting the eer of -, Mm, and A,,, the second equality 
follows by the definitions with m + n?* = A,,(n*) = a m(n))* = (m+n)t, and when 
n=0, this produces m+ 1=m-+0* = (m+0)T = (A,,(0))t = mt. 


Theorem 3.2.3. The following identities hold for all m,n, p € No: 
(1) (Additive identity) m+0=m=0+m. 
(2) m-0=0=0-m. 
(3) (Associativity of addition) m+ (n+p) =(m+n) +p. 
(4) (Commutativity of addition) m+n=n+™m. 


(5) (Multiplicative identity) m-1=m=1-™m. 
(6) (Distributivity) m- (n+p) = (m ar (mp). 
(Distributivity) (n+p)-m=(n-m)+(p-m). 


(7) (Associativity of multiplication) m-(n-p) = (m-n)-p. 
(8) (Commutativity of multiplication) m-n=n-m. 
Proof. (1) For all m € No, by definitions of + and Am, m+0 = A,»(0) =m. It remains 
to prove that 0+m=m. Let T= {meENy):0+m=~m}. Since 0+ 0 = Ao(0) = 0, it 
follows that 0 € T. If m € T, then 
0+(mt) =(0+m)* (by Remark 3.2.2) 
=mt (asmeéT), 

so that m* € T. Theorem 3.1.8 then says that for all m € No, m+0=0=0+m™m. 

(2) By definition of -, m-0 = M,,(0) = 0. Let T = {m € No: 0-m =O}. By 
m:-0=0,0€ET. Ifm eT, then 

0-(m*)=0+4+(0-m) (by Definition of -) 
=0+0 (asmeéT) 
=0 (by (1)), 

so that mt € T. Thus (2) is proved by Theorem 3.1.8. 

(3) Fix m,n € No, and let T = {p € No: m+ (n+p) = (m+n)+4+p}. Since 
m+(n+0)=m+n=(m-+n)-+0, it follows that 0 € T. If p€ T, then 

m+(n+(pt))=m+((n+>p)*) (by Remark 3.2.2) 
=(m+(n+p))" (by Remark 3.2.2) 
( 
( 


as p € T) 
by Remark 3.2.2), 


=((m+n)+p)* 
= (m+n) + (p*) 
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so that pt € T, and so (3) holds by Theorem 3.1.8. 

(4) You prove this. First prove that n + 1= 1+ for all n € No. 

(5) By definition of -, m-1=m-(0T) =m-+m.-0. In (2) we already proved that 
m-0 = 0, and in (1) we proved that m+0 =m. Thusm-1=m-0+m=04+m=m. 
Let T={meENo:1-m=m}. By (2),0ET. IfmeT, then 

1-(m*)=1+(1-m) (by Definition of -) 
=l+m (asmeé€T), 
=m+1 (by (4)), 
=m (by Remark 3.2.2), 


so that mt € T, and Theorem 3.1.8 finishes the proof of (5). 
(6) Fix m,n € No. Let T = {p € No: m- (n+p) = (m-n)+(m-p)}. By (2), 
m-0=0, and by (1), (m-n)+0=™m-n, so that 0€ T. If p€ T, then 
m-(n+(pt)) =m-((n+p)*) (by Remark 3.2.2) 
=m+(m-(n+ p)) (by Definition of -) 


=m+((m-n)+(m-p)) (as p€T) 
=(m-+(m-n))+(m-p) (by (3)) 

= ((m-n)+m)+(m-p) (by (4) 
=(m-n)+(m-+(m-p)) (by (3)) 
=(m-n)+(m-(pt)) (by Remark 3.2.2), 


so that pt € T. Thus the first part of (6) follows by Theorem 3.1.8. 
Fix m,n € No. Let T = {p € No: (m+n)-p=(m-p)+(n-p)}. By (2) and (1), 
(m+n)-0=0+0=(m-0)+(n-0), so that 0€ T. If pe T, then 


(m+n)+ (pt) =Mm+n(pt) (by Definition of -) 


=(m+n)+((m+n)-p) (by Definition of Mn+n) 
=(m+n)+((m-p)+(n-p)) (aspeT) 
= ((m+n)+(m-p))+(n-p) (by (3)) 
= ((n +m) +(m-p))+(n-p) (by (4)) 
= (n+ (m+ (m-p)))+(n-p) (by (3)) 
= (n+ Min(pt)) + (n-p) (by Definition of Min) 
= (Mm(p") +n) + (n+p) (by (4)) 
) 


= Mn(p") + (n+ (n+ p)) (by (3)) 
= My, (pt) + (n+ M,(pt)) (by Definition of M,,) 
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=(m-pt)+(n-p*) (by Definition of -), 

so that pt € T, and so by Theorem 3.1.8, the second part of (6) holds for all m,n,p € No. 

(7) You prove (7). 

(8) Fix m € No. Let T= {n €No:n-m=m-n for all m € No}. By (2), 0€ T. If 
neéT, then 

m-(nt)=m+m-n_ (by Definition of -) 

=m+n-m (since n € T) 
=1-m+n-m (by (5)) 
=(1+n)-m_ (by (6)) 
=(n+1)-m (by (4)) 


=nt*-m, 


so that n* € T, which proves (8) by Theorem 3.1.8. 


This gives us the familiar No with the familiar arithmetic properties. In particular, 
No has a binary operation + with identity 0 and a binary operation - with identity 1. We 
already proved abstractly in Theorem 2.5.5 that such identities are unique. 


Exercises for Section 3.2 

3.2.1. Verify with the new definitions that 3-2 = 6 and that 2-3 = 6. Was one easier? 
3.2.2. Prove (4) and (7) in Theorem 3.2.3. 

3.2.3. Prove that for all m,n € No, mt +n=nt+m. 


3.2.4. Prove that for all m,n,p € No, (m*)-(p+n) =m-(pt+n)+(ptn). 


3.3 Order on No 


Definition 3.3.1. For a,b € No, we write a < b to mean a C b. Relations <,>,> are 


defined from < as in Section 2.7. In particular, a < b means that a C b anda ¥F b. 


Remark 3.3.2. 
(1) Certainly 0 < n for all n € No as 0 represents the empty set. 
(2) < is reflexive. 
(3) < is transitive, as set inclusion is transitive. It follows by Exercise 2.7.2 that 
<,>,> are transitive. 
(4) < is not symmetric: 0 < 1 but 1 £ 0 (because the empty set has no non-empty 
subsets). 
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(5) For all m,n € No, by definition of set equality, whenever m <n and n < m, then 
m= 

(6) By Theorem 3.1.12, m € n if and only if m Cn, which in the order notation says 
that m € n if and only ifm <n. 

(7) For alln € No, certainly n C nt, so that n < nt. Furthermore, by Theorem 3.1.11, 
nen. 

(8) By Theorem 3.1.13, m <n if and only if mt < n*. 

(9) By Theorem 3.1.14, for all m,n € No, either m <n orn <m. 


Theorem 3.3.3. (Trichotomy on No) For all m,n € No, exactly one of the following 
conditions hold: 

(1) m<n, 

(2) iy <a 

(3) i= ne 


Proof. By the last remark above, either m < n or n < m, which means that one of 
the listed three conditions holds. If (1) and (2) hold or if (1) and (3) hold, then by 
transitivity of <,m < m, which contradicts the definition of <. If (2) and (3) hold, then 
by transitivity, n <n, which again gives a contradiction. So no two of the conditions can 


hold simultaneously, which proves the theorem. 


Theorem 3.3.4. (The well-ordering principle) No is well-ordered (see Definition 2.7.7). 
In other words, every non-empty subset S of No has a least element, that is, an element 
r such that for allt € S,r <t. 


Proof. We will prove that the following set is inductive: 
T={neEN: if SCNo andn€ 5S, then S has a least element}. 


Note that 0 € T because 0 < n for all n € No (and hence 0 < n for all n € S). Suppose 
that n € T. We need to prove that nt € T. Let S C No and nt € S. By assumption that 
n € T, it follows that the set S U {n} has a least element. Thus there exists r € SU {n} 
such that for allt € S,r<tandr<n. If r €S, then we just showed that for all t € s, 
r <t, so that S has a least element. Now suppose that r ¢ S. So necessarily r = n, and 
this is not an element of S. Then we claim that n* is the least element of S. Namely, let 
t € S. We need to prove that nt < t. Suppose for contradiction that t < n*. Since n is 
the least element of SU {n}, it follows that n < t. Thus we have n < t < n™, so that by 
Theorem 3.1.7, necessarily n = t. But then n = t € S, which contradicts the assumption 


that n ¢ S. Thus nt < t, and since t was an arbitrary element of S, it follows that nt 
is the least element of S. Thus in all cases, if nt € S, the set S has a least element. So 
n* €T. Thus T is an inductive set, and hence equal to No. 
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This means that every non-empty subset of No has a least element. 


Now we relate order to arithmetic more directly: 


Theorem 3.3.5. Ifm,n € No and m < n, there exists r € No such that m+r=n. 


Proof. Let T= {n € No: for all m € No, if m <n then dr € No such that m+ r= n}. 

If m € No and m < 0, then since 0 stands for the empty set, necessarily m = 0, and 
hence we can set r = 0 to get m+r=n. This proves that 0 € T. 

Suppose that n € T. We want to prove that nt € T. Let m € Ng such that 
m<nt. Ifm=0, set r= nt; then m+r=0+nt =n". If instead m > 0, then by 
Theorem 3.1.6, m = pt for some p € No. Then by assumption pt = m < n*, and so 
by Exercise 3.3.1, p < n. Since n € T, there exists r € No such that p+ r =n. Hence 
m+r=pt+r=(p+r)t = nt. Thus in all cases, whether m = 0 of m > 0, there 
exists r € No such that m+r = nt, so that nt € T. Thus T is an inductive set, and 
since No is the smallest inductive set, it follows that T = No. (Alternatively, we can invoke 
Theorem 3.1.8.) 


Exercises for Section 3.3 
3.3.1. Let m,n € No. Prove that m <n if and only if m* < n7. 


3.3.2. Prove that ifm < nandn < p, thenm < p. Verify also other standard combinations 
of < and <. 


3.3.3. Let m,n € No and m < n. Prove that there exists r € N* such that m+r=n. 


3.3.4. (Compatibility of < and < with addition) Let m,n,p € No. Prove that m >n 
if and only if m+p>n-+p and that m > nif and only ifm+p>n+p. 


3.3.5. (Compatibility of < with multiplication by non-zero numbers) Let 
m,n,p € No. Suppose that m <n. Prove that mp < np for all p. 


3.3.6. Suppose that m,p,n € No and m <n. Prove that m+p<n+p"*. 

3.3.7. Prove that for all n € No, n < 2”. (Recall Notation 2.5.13. Use inductive sets.) 
3.3.8. Prove that for all a,b € N*, a < ab (and b < ab). 

3.3.9. Let a,b,c € No. Is it true that ab = c implies that a < c? 

3.3.10. Let m,n,r € No such that m=n-+r. Prove that m <n. 
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3.4 Cancellation in No 


The familiar subtraction cannot be applied to arbitrary pairs of elements in No: 
recall that elements of No are sets, and we have no way of identifying the familiar negative 
numbers with sets. However, we can cancel addition of equal terms, as we prove below in 
Theorem 3.4.2. We need an intermediate result: 


Theorem 3.4.1. Let m,n €No. Then m =n if and only ifm*t =n. 


Proof. By set inclusions and definition of order, m = n is the same as saying m < n and 
n <m. By Remark 3.3.2, this is the same as mt < nt and n* < m7, and by set inclusions 


again this holds if and only ifm? = nt. 


Theorem 3.4.2. (Cancellation theorem) Let m,n,p € No. 
(1) m+p=n+4 pif and only ifm =n. 
(2) m+n = 0 if and only ifm =0 and n=0. 
(3) m-n=0 if and only ifm =0 orn=0. 
(4) Ifp #0 and p-m=p-n, thenm=n. 


Proof. (1) Certainly ifm =n, then m+ p= Am(p) = An(p) =n+p. Let T= {peENo: 
whenever m+ p= n+p for some m,n € No, then m = n}. By Theorem 3.2.3 (1), 0 € T. 
Suppose that p € T. We will prove that p* is also in T. For this, suppose that for some 
m,n € No, m+ (pt) = n+ (pt). By Remark 3.2.2 this says that (m+ p)t = (n+ p)*. 
Lemma above (Theorem 3.4.2) then implies that m+p= n+p. This proves that pt € T, 
so that Tis an inductive set. This proves the additive cancellation. 

(2) Certainly 0 + 0 = 0. Now suppose that m+n = 0. This says that A,,(n) = 0. 
If n £ 0, then n = q* for some q € No. Thus by the definition of addition, A,,(n) = 
Am(q*) = (Am(q))*, which cannot be the empty set. So necessarily n = 0. But addition 
is commutative, so similarly m = 0. This proves (2). 

(3) In Theorem 3.2.3 (3) we proved that ifm = 0 or n = 0, then m-n = 0. Now 
suppose that m-n = 0. If n £0, then n = p* for some p € No (by Theorem 3.1.6), hence 
0=m-(pt)=m-p+m. Ifm 40, then m is a successor of some element k of No. Thus 
m:-ptm=m-pt+tkt=m-pt+(k4+1)=(m-p+k)+1=(m-p+k)* is the successor 
of some element of No as well, which means that m-p+m #0. So necessarily m = 0. We 
just proved that n 4 0 implies m = 0, or in other words, we just proved that either n = 0 
or m = 0. This proves (3). 

(4) Fix p € No \ {0}. Let 


T ={me€ENo: whenever p:m=p-n for some n € No, then m = n}. 


114 Chapter 3: Construction of the number systems 


If p-0 = p-n, then by (3), 0 =p-n and n = 0, so that 0 € T. Suppose that m € T and 
that for some n € No, p- (mt) = p-n. By (3), n 4 0, so by Theorem 3.1.6, n = rt for 
some r € No. By Definition of -, p-(m*) =p-m+p, and p-n=p-:(r*) =p-r-+p, so 
that p-m+p=p-r-+p. By (1), p-m=p-r, so that as m € T, necessarily m = r. Then 
m* =rt =n, which proves that m* € T. Thus T = No by Theorem 3.1.8. 


Exercises for Section 3.4 

3.4.1. Suppose that m,p € No satisfy m-(p +m) = 0. Prove that m = 0. 

3.4.2. Suppose that m,p,n € No satisfy m+ p=m- (nt). Prove that p= m-n. 

3.4.3. (Compatibility of < with multiplication by non-zero numbers) Let 


m,n,p € No. Suppose that m <n. Prove that mp < np for all p and that mp < np for 
all p £ 0. 


3.5 Construction of Z, arithmetic, and order on Z 


How are we supposed to think of the familiar —5? It does not seem to be possible 
to represent negative numbers with sets as we did for non-negative numbers. We go about 
it by using the rigorous construction of No and all the arithmetic on it. The following is a 
rehashing of Example 2.3.10. 


Definition 3.5.1. Consider the Cartesian product No x No. Elements are pairs of the 
form (a,b), with a,b € No. If.a,b,a’,b’ € No, we write (a,b) ~ (a’,b’) ifa+b' =b4+da’. 


In Example 2.3.10 it was proved that the relation ~ is an equivalence relation on 
No x No. Thus we can talk about the equivalence class [(a,b)] of (a,b). 


Definition 3.5.2. We define Z to be the set of all equivalence classes of elements of No x No 
under the equivalence relation ~. Elements of Z are called whole numbers, or integers. 


We have a natural inclusion of No into Z by identifying n € No with [(n,0)] in Z. 
Note that if n,m € No are distinct, then so are [(n,0)| and |[(m,0)], so that No is indeed 
identified with a natural subset of Z. But Z is strictly larger: [(0, 1)] is not equal to [(m, 0)] 
for all m € No. 
To define arithmetic on Z, we first define arithmetic on No x No. Start with addition: 
for a,b, c,d € No, set 
(a,b) + (c,d) = (a+c,b+d). 
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(There are three “+” here: the last two are the already well-studied addition in No, and 
the first one is the one we are defining now on Ng x No. Yes, we are using the same name 
for two different operations; but that’s ok: many families have a child named Pat, and the 
many Pats are not all equal.) Since + on No is binary, so is + on No x No. We show next 
that + is compatible with ~: 


Theorem 3.5.3. Let a,a’,b,b’,c,c’,d,d' € No such that (a,b) ~ (a’,b’) and (c,d) ~ 
(c,d’). Then (a+c,b+d)~(a’+c,0+d’). 


Proof. By assumption, a+ b’ = a’ +b and c+d' =c' +d. Then by associativity and 
commutativity of + on No, (a+c)+(b'4+d') = (a+v'/)+(c+d’) = (a +b) 4+(o+da)= 
(a+c)+ (0! +d’), so that (a+c,b+d) ~ (a’+c¢,0'+d’). 


This shows that + is well-defined on the equivalence classes of ~, hence + makes 
sense on Z: 

[(a, 6)] + [(¢, d)] = [(a+c¢,b+d)]. 

For example, identifying 5,6 € No with [(5,0)], [(6,0)] € Z, we get 5+ 6 = 11 and 
[(5,0)] + [(6,0)] = [(5 + 6,0)] = [(11,0)], as expected. Also, [(5,0)] + [(0,6)] = [(5,6)] = 
[(0, 1)]. 
Multiplication on Z is a little more complicated: if (a,b) and (c,d) are in No x No, 


then we declare 
[(a, b)] - [(c,d)] = [(a-c+b-d,a-d+b-c)]. 


We next prove that - is well defined on equivalence classes. Suppose that |(a, b)| = [(a’, b’)] 
and [(c, d)] = [(c,d’)]. This means that a+b’ = a’+b and c+d' = c'+d. By the established 
associativity, distributivity, and commutativity properties in Ng, we then have that 
(ac+ bd +a'd' +b’) 4+ (a+0')d' +(a' +b)¢ +d(c+ dd’) + a(c +d) 

=ac+bd+a'd' +b'c' +ad' +0'd' +a'c + bc’ + be + bd’ + ac’ + ad 

=ad+be+a'c +0'd' +a'd' +bd' + ac’ + bc’ + bc! + bd + ac + ad’ 

= (ad+be+a'c' + b'd')+(a’+b)d' + (a+0')¢ + b(c +d) +a(c+d) 

= (ad+bct+a'c+0'd')+ (a+0')d' + (a +b)c + b(c+d')+a(c +4), 
so that by cancellation Theorem 3.4.2 in No, ac+ bd+a’d'+b'¢c =ad+bc+a'c +0b'd’. It 
follows that [(ac + bd, ad + bc)] = [(a’c' + b'd',a'd' + b’c’)], so that multiplication - on Z is 
well-defined. 

Consequently, [(5,0)] - [(6,0)] = [(5-6+0-0,5-0+0- 6)] = [(30,0)], as expected. 

Similarly compute [(5, 0)] - [(0, 6)]. 


We next prove the basic arithmetic laws on Z. 
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Theorem 3.5.4. The following identities hold for all m,n,p € Z: 
(1) (Additive identity) m+ [(0,0)] =m = [(0,0)] +m. 
(3) (Associativity of addition) m+ (n+p) =(m+n) +p. 
(4) (Commutativity of addition) m+n=n+™m. 
(5) (Multiplicative identity) m - [(1,0)] =m = [(1,0)]-m. 
(6) (Distributivity) m-(n+p) =(m-n)+(m-p). 
(7) (Associativity of multiplication) m-(n-p) = (m-n)-p. 
(8) (Commutativity of multiplication) m-n=n-m. 
(9) (Existence of additive inverses) There exists r € Z such that m+r =r+m= 
[(0, 0)}. 
Proof. Let a, b,c, d,e, f € No such that m = [(a, b)], n = [(c,d)| and p = [(e, f)]. With this, 
the proofs follow easily from the definitions of + and -. For example, 
m + [(0,0)] = [(a, 6)] + [(0, 0)] = [(a + 0, 6 + 0)] = [(a, b)] =m, 
and similarly [(0,0)] +m =m. This proves (1). 
Properties (2)-(4) are equally easy to prove. 
To prove (5), we compute 
m - [(1,0)] = [(a, 6) - (4, 9)] 
[(a-1+6-0,a-0+6-1)] 
(a + 0,0 + b)] = [(a,6)] =m, 


and similarly [(1,0)]-m =m. 
We check (6), and you explain each step: 


(m- (n+ p)) = ([(a,5)] - ([(e, @)] + (fe, f)]) 
= |(a,6))-[(c+e,d+ f)] 
= |(a-(c+te)+b-(d+f),a- (d+ f)+5- (ce+e))] 
(a-cta-e+b-d+b-f,a-d+a-f+b-c+b-e)| 


(why may we omit parentheses in addition?) 
=|(a-c+b-d,a-d+b-c)|+[(a-e+b-f,a-f+5-e)| 
= [(a, b)] - [(c, d)] + [(a, )] - [e, F)] 


=m-n+m-p. 


You verify (7) and (8). 
(9) Set r = [(b,a)]. Then m+r = [(a, 6)] + [(b, a)] = [((a+b,b+a)] = [(a+6,a+6)] = 
[(0,0)], and similarly r + m = [(0,0)]. 
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Notation 3.5.5. The additive inverse of the equivalence class [(a, b)| of (a,b) is the equiv- 
alence class [(b,a)| of (b,a). The additive inverse of an element m is always denoted —m. 


Notation 3.5.6. Furthermore, for any a,b € No, by trichotomy on No (Theorem 3.3.3), 
exactly one of the following three conditions is satisfied: 
axb, a=b, b< a. 


In the first two cases, by Theorem 3.3.5, there exists r € No such that a + r = b, and then 
[(a, b)] = [(a,a+r)] = [(0,r)], and in the last case there exists r € No such that b+r=a, 
and then [(a, b)] = [(b+ 7, b)] = [(r,0)]. This proves that every element of Z can be written 
as the equivalence class of [(a,b)] where a or b is 0. 

Recall the identification of elements of No with elements in Z: n € No is identified 
with [(n,0)]. Then with the last two paragraphs, 


[(a, b)| = [(a, 0)] oF [(0, b)] =a+ (—b), 
and it is standard shorthand to write this as a — b. Thus in the future we write elements 
[(a,b)] in Zasa— b. 
Theorem 3.5.7. Let m,n € Z. Then (—m)-n=m- (—n) = —(m-n). 
Proof. Observe that 
m:-(—n)+m-n=m- ((—n) +n) (by distributivity: Theorem 3.5.4 (6)) 
= m-0 (by notation for additive inverses) 


= 0 (by Theorem 3.5.4 (2)). 


Similarly, m-n+m-(—n) = 0. Thus m- (—n) is an additive inverse of m-n, and by 
uniqueness of inverses (Theorem 2.5.10), m-(—n) is the additive inverse of m-n, i.e., 
m:-(—n) =—(m-n). 

With that, 


(—m)-n=n-(—m) (by commutativity of multiplication) 


= —(n-m) (by the previous paragraph). 


Theorem 3.5.8. Let m,n € Z. Then m-n= 0 if and only ifm = 0 orn = 0. 


Proof. We already proved that if m= 0 or n = 0, then m-n = 0. 

Now suppose that m:n = 0. Write m = |(a,b)| with a,b € No. By Notation 3.5.6 
we may assume that either a = 0 or b= 0. Write n = |[(c,d)] for c,d € No. 

If a = 0, then [(0,0)] = 0 = m-n = [(0,b)] - [(c,d)] = [(b- d,b- c)]. This means (by 
definition of ~) that b-d =b-c. If b = 0, then m = [(0,0)] = 0, and if b 4 0, then by 
Theorem 3.4.2, d = c, so that n = [(c, d)| = [(0,0)] = 0. 
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Similarly, if b = 0, then either m = 0 or n = 0. But a = 0 or b = 0 covers all the 


possible cases for elements of Z, so the theorem is proved. 


Definition 3.5.9. (Order on Z) Define a relation < on Z as follows: |(a,b)| < |(c,d)] if 
a+d<b+c, where the latter < is the already known relation on No (see Section 3.3). 
Similarly define <,>,>. 


It is left to the reader to verify the following: 


Theorem 3.5.10. 
(1) < on Z is well-defined (this means that if one picks different representatives of the 
equivalence classes, < for one pair holds if and only if it holds for the other pair). 
(2) <,> are reflexive and transitive but not symmetric. 
(3) <,> are transitive but not reflexive and not symmetric. 
(4) Ifm € Z and0 < m, thenm € No. 
(5) Ifm € Z and m < 0, then —m € No. 


Theorem 3.5.11. The following properties hold on Z: 
(1) (Trichotomy) For any m,n € Z, exactly one of the following three conditions 
holds: 
Gia. 
(ii) m <n. 
(iii) n <_m. 
(2) (Transitivity) For all m,n,p € Z,m <n andn< p imply m < p. 
(3) (Compatibility of < with addition) For allm,n,p € Z,m < n implies m+p < 
n+p. 
(4) (Compatibility of < with multiplication by positive numbers) For all 
m,n, p€ Z,m<n and 0< p imply mp < np. 


Proof. Write m = [(a, b)|, n = [(c,d)], and p = [(e, f)] for some a, b,c, d,e, f € No. 
By Theorem 3.3.3, exactly one of the following three conditions holds: 
at+d<b+c, a+d=b+c, b+c<acd. 


By definition, and since <,=, > are compatible with the choice of representatives by The- 
orem 3.5.10, this says that exactly one of the following three conditions on Z: 


(a,b) < [lad], [ao] =[le 4a], [(e,d)] < [(a, 6]. 


This proves (1). 
Suppose that m <n and n < p. This means that a+d<b+candc+f<d+e. 
Then by associativity and commutativity of addition in No, and by transitivity of < on No, 
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(a+ f)+(c+d) =(a+d)+(c+f) < (a+d)+(d+e) < (b+c)+(d+e) = (b+e)+(c+ a), 
so that by Exercise 3.3.4, a+ f < b+e. This says that m < p, which proves that < is 
transitive. This proves (2). 


The other properties follow similarly and are left to the reader. 


With this, we can use the adjectives positive, negative, non-positive, non- 
negative from Definition 2.7.10 for elements of Z, and in general we can use the properties 
developed in Sections 2.7 and 2.10, such as absolute values, triangle inequality, and 
reverse triangle inequality. We generalize Theorem 3.3.4 as follows: 


Theorem 3.5.12. Let S be a non-empty subset of Z that is bounded above (resp. below). 
Then S contains its largest (resp. least) element. (For definitions see Definition 2.7.2.) 


Proof. Let s € S, and let b € Z be an upper bound on S. Let T= {n €E No: b-—ne S}. 
Then T is a subset of No. Note that b— s € No and that s = b— (b—s) € S, so that T is 
not empty. Thus by Theorem 3.3.4, JT has a least element, call it t. Thus b—t € S. We will 
prove that b—t is an upper bound for S. Let r € S. We need to prove that b—t > r. Since 
b is an upper bound on S$ and r € S, we have that b—r € No. Since r= b-(b-—r) ES, 
we have that b—r € 7. But then by the choice of t, t < b—r, which proves that r < b—t, 
as desired. This proves the first part. 

Now let s € S, and let b € Z be a lower bound on S. Let T= {ne€Z:b+ne S}. 
Since b is a lower bound, we conclude that T' is a subset of No. Note that s — b € No and 
that s = b+(s — 6b) € S, so that T is not empty. Thus by Theorem 3.3.4, T has a least 
element, call it t. Thus b+ +t € S. We will prove that b+ ¢ is a lower bound for S. Let 
r € S. We need to prove that b+t <r. Since b is a lower bound on S and r € S, we have 
that r—b € No. Since r = b+ (r —b) € S, we have that r —b € T. But then by the choice 
of t, t <r —b, which proves that r > b+ t, as desired. This proves the second part, and 


hence the theorem. 


Exercises for Section 3.5 
3.5.1. Prove that for all a,b,c € No, (a,b) ~ (at+c,b4+c). 


3.5.2. Suppose that a,b,c,d € No such that (a,b) ~ (c,d) and a > c. Prove that there 
exists e € No such that a=c+eandb=d+e. 


3.5.3. Convince yourself that Z as defined in Definition 3.5.2 is the same as the familiar 
set of all integers. 
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3.5.4. (Cancellation in Z) Let m,n, p € Z. 
i) Ifm+p=n+p, prove that m =n. (Hint: Theorem 2.5.16.) 
ii) Ifp # 0 and m-p = n-p, prove that m = n. (Note that Theorem 2.5.16 is 
inapplicable.) 
3.5.5. Prove that if (a,b) ~ (c,d) then (a,c) ~ (6, d). 
3.5.6. Let a,b,c,d € No. Suppose that [(b,a)| < [(c,d)]. Prove that [(a,b)] > [(d,c)] and 
[(a, b)] + [(e, d)] > [(0, 0)]. 
3.5.7. Prove (2), (3), (4), (7), (8) in Theorem 3.5.4. 
3.5.8. Prove Theorem 3.5.10. 
3.5.9. Prove (3) and (4) in Theorem 3.5.11. 


3.6 Construction of the ordered field Q 


We next define the set Q, arithmetic, and order on it, and we show that Q is a field. 
Elements of course correspond to the familiar rational numbers. 


The following is a rehashing of the construction of Q from Z first given in Exer- 
cise 2.3.10. The relation 2 on Z x (Z \ {0}), given by 


(m,n)i(m',n') if m-n'=m'-n, 


is an equivalence relation. The reader may wish to go through the details of the proof, to 
check that only those properties of Z are used that we have proved in the previous section, 
and that no “well-known” knowledge creeps into the proof. 


Remark 3.6.1. For any a € Z and b,c € Z \ {0}, the equivalence class [(ca, cb)] equals 
the equivalence class [(a, b)]. In particular, by possibly taking a negative c, we may always 
choose a representative of any equivalence class in Z x Nt. 


Definition 3.6.2. We define the set of all equivalence classes of this relation to be Q. The 
elements of Q are called rational numbers. 


We have a natural inclusion of Z into Q (and thus of No into Q) by identifying m € Z 
with [(m,1)]. Note that if m,n € Z are distinct, so are [(m,1)] and [(n,1)], so that Z is 
indeed identified with a natural subset of Q. But Q is strictly larger: [(1,2)] is not equal 
to [(m,1)] for any m € Z. More about the familiar rational numbers is in Notation 3.6.5. 


We declare the following binary operations + and - on elements of Z x (Z \ {0}): 


(m,n) + (p,q) =(m-qt+p-n,n-q), (m,n)- (p,q) =(m-p,n-q). 
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(The + and - inside the pair entries are the operations on Z; the other + and - are what 
we are defining here.) We show next that the new + and - are compatible with 2: 


Theorem 3.6.3. Let m,m’,n,n’',p,p',q,q/ € No such that (m,n)? (m',n’) and (p,q)? 
(p’,q’). Then 


((m,n) + (p,q) U((m',n’) + (p',q’)), 
and ((m, n) - (p,q) ¢((m', n’) - (p',q’)) - 


Proof. By assumption, m:n’ = m’-n and p-q' = p’-q. Then (you explain each step): 


(m-qtp-n)-(n'-q')=(n'-q')-(m-q+p-n) 
= (n'-q')-(m-q) +(n'-q')- (pn) 
=(q'-n')-(m-q)+n'- (q': (p-n)) 
=q'-(n'-(m-q))+n'- ((q'-p) +n) 
=q'-((n’-m)-q)+n'- ((p-d’)-n) 
=q'-((m-n')-q)+n'- ((p'-q)-n) 
=q':((m'-n)-q)+n'- (p'- (q-n)) 
=q'-(m'-(n-q))+n'-(p':(n-q)) 
=(q'-m’)-(n-q) + (n'-p’)- (n-q) 
= (n-q)-(q'-m') + (n-q)-(n'-p’) 
=(n-q)-(m'-q') +(n-q): (p> n’) 
= (n-q)-(m'-q' +p'-n') 
=(m'-q'+p'-n’)-(n-q). 


This says that 
(m-q+p-n,n-q),=(m'-qt+p'-n',n'-¢’), 
which in turn says that ((m,n) + (p,q)) ¢((m’,n’) + (p’,q’)). 
The verification of the easier ((m, 7) - (p,q))2((m’, n’) - (p’, q’)) is left to the reader. 


The last theorem proves that the following binary operations + and - on Q are 
well-defined: 


[(m,n)] - [(p,@)] = [(m, 2) - (p, g)] = [(m- p, n+ @)]- 


The + and - on the left are what we are defining here; the + and - in the middle are binary 
operations on Z x (Z \ {0}), and the + and - on the right are binary operations on Z. 
We now move to arithmetic laws on Q. 
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Theorem 3.6.4. Q is a field (see Definition 2.6.1), or explicitly, the following identities 
hold for all m,n, p € Q: 
(1) (Additive identity) m+ [(0,1)] =m = [(0,1)] +m 
(2) m -[(0,1)] = [(0, 1)] = [(0, 1)] -m 
(3) (Associativity of addition) m+ (n+p) =(m+n)+p. 
(4) (Commutativity of addition) m+n=n+™m. 
(5) (Multiplicative identity) m- [(1,1)] =m =[(1,1)]-m 
(6) (Distributivity) m- (n+p) =(m-n)+(m-p). 
(7) (Associativity of multiplication) m-(n-p) = (m-n)-p. 
(8) (Commutativity of multiplication) m-n=n-m. 
(9) ((1,1)] 4 ((0,D). 
(10) (Existence of additive inverses) There exists r € Q such thatm+r=r+m= 
(0,1)} 
(11) (Existence of multiplicative inverses) If m 4 |(0,1)], there exists r € Q such 
thatine? =r om = (471)). 


Proof. Let a,b,c, d,e, f € Z such that m = [(a, b)], n = [(c,d)] and p = [(e, f)]. With this, 
the proofs follow easily from the definitions of + and -. For example, 


m + [(0, 1)] = [(a, 6)] + [(0, 1)] = [(a + 0,6- 1)] = [(a, 6)] = m, 
and similarly [(0,1)] +m =m. This proves (1). 
Properties (2)-(5) are equally easy to prove. 
We check (6), and you explain each step: 
“(Ie d)] + Ile, A) 
]-[fe- f+e-dd-f)| 
-f+e-d),b-(d-f 


m:(n+p) 
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“— 

S 
z= 
Se 


| 
~~ 

Q 
~~ 


You verify (7) and (8). 

(9) By the properties of Z,0-1=040* =1=1-1, so that (0,1) % (1,1), so that 
(0,1) 4 (1,1) in@ 

(10) By definition of m and Q, [(—a, b)] € Q, and m+[(—a, b)] = [(a-b+(—a)-b, b-b)] = 
[((b- a+b: (—a),b- 6)| = [(6- (@+ (—a)), b+ b)] = [(b- 0, - &)] = [(0,6- 6)] = [(0,1)], and 
similarly |(—a, b)| +m = [(0,1)]. Thus additive inverses exist in Q. 

(11) By assumption [(a,b)] 4 [(0,1)], This means that a- 1 4 0-6, so that a F 0. 
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Then [(b,a)] € Q, and [(a,b)] - [(b, a)] = [(a- 6, b- a)] = [(a- b,a- b)| = [(1,1)]. Similarly 
[(, @)] - [(a, 6)] = [GD]. 


Notation 3.6.5. The additive inverse of the equivalence class [(a, b)| of (a,b) is |(—a, b)], 
and the multiplicative inverse of non-zero |(a,b)]| is [(b,a)]. For an element m, its additive 
inverse is always denoted —m, and the multiplicative inverse, when it exists, is written 
m-—'. Thus for any n € Z, by identifying n with [(n,1)] in Q, the additive inverse is —n = 
—[(n,1)] = [(—n, 1)], and if n 4 0, the multiplicative inverse is n~! = [(n, 1)]~! = [(1,n)]. 
Thus, under our identifications, any element [(m,n)] in Q is equal to 


[(m,n)] = [(m, )] -[A,n)] = m-n™, 


and it is standard shorthand to write this as m/n. Thus in the future we write elements 
[(m,n)] in Q as m/n, and whenever m,n € Z and n ¥ 0, then [(m,n)] = m/n € Q. By 
Remark 3.6.1 we may even assume that n > 0. 


Theorem 3.6.6. (Cancellation laws in Q) Let m,n,p € Q. 
(1) Ifm+p=n+p, thenm=n. 
(2) Ifm:-p=n-p and ifp #0, then m=n. 


Proof. Since every p € Q has an additive inverse and since every non-zero p € Q has a 


multiplicative inverse, both parts are simply special cases of Theorem 2.5.16. 


Theorem 3.6.7. Q is an ordered field (see Definition 2.7.10) with the order < given by: 


ail Gd < bes <it-bd > 0° 
[(a, b)] < [(¢, d)] if a >be; ifbd <0. 


Proof. Theorem 3.6.4 proves that Q is a field. Now we prove that Q is ordered. 

First we need to prove that < is well-defined. Let m,n € Q. Write m = [(a,b)] = 
[(a’, b’)], n = [(c, d)] = [(c’, d’)|. By definition of Q, a’b = ab’ and c'd = cd’. By associativity 
and commutativity of multiplication in Z, (bd)(b'd’)(b'c! — a’d’') = bb'dd'(b'c — a’'d') = 
b(b’)2d' (dc’) — b'd(d’)?(ba’) = b(b')?d'(ed’) — b'd(d’)?(ab') = (b')?(d')?(be — ad). Thus the 
sign of (bd)(b’d’)(b'c’ — a’d’) is the same as the sign of bc — ad. In particular, if bd > 0, 
b’d' > 0, and ad < bc, then necessarily a'd’ < b'c’; if bd > 0, b’d’ < 0, and ad < bc, then 
necessarily a’d' > b'c’; if bd < 0, b’d’ > 0, and ad < bc, then necessarily a’d' > b'c’; if 
bd < 0, b’d’ < 0, and ad < bc, then necessarily a'd’ < b’c’. All these cases confirm that the 
order is independent of the representative of the equivalence class. 

Trichotomy follows from trichotomy on Z (Theorem 3.5.11). The proof of transitivity 
of <, and of compatibility of < with addition and with multiplication by positive rational 


numbers is left as an exercise. 
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Thus the following terms and facts, developed in Sections 2.7 and 2.10, apply to 
elements of Q: positive, negative, non-positive, non-negative, intervals, absolute 


value, triangle inequality, reverse triangle inequality. 


Theorem 3.6.8. (Archimedean property) Let m,n € Q. Ifm > 0, there exists p € NT 
such that n < pm. 


Proof. If n < 0, then take p= 1, and n <0 <m=pm. So we may assume that n > 0. 
Write m = a/b and n = c/d for some positive a,b,c,d € Z. Set p= bc +1. Then pisa 
positive integer, as is ad. By compatibility of < with addition on Z and multiplication by 
positive integers, we have that pad = bcad + ad > bcad > bc, and since b,d > 0, it follows 
that pm = p(a/b) > c/d=n. 


Exercises for Section 3.6 

3.6.1. Compute (5/6) - (—3/4), (5/6) + (—3/4). 

3.6.2. Prove that for all (m,n) € Zx (Z\ {0}) and all non-zero p € Z, (m,n) (m-p,n-p). 
In other words, m/n = (m-p)/(n-p). 

3.6.3. Prove (2), (3), (4), (5), (7), (8) in Theorem 3.6.4. 

3.6.4. Finish the proof of Theorem 3.6.7. 

3.6.5. Let m,n,p,q € Z with n,q non-zero. Prove that m/n-+ p/q and m/n- p/q are 
rational numbers. 


3.6.6. Let m € Q. Prove that there exists a positive integer N such that m < 2%. (Hint: 
If m < 0, this is easy. Now suppose that m > 0. Write m = a/b for some positive integers 
a,b. Prove that a/b < a < 2%; perhaps apply or reprove Exercise 3.3.7.) 

3.6.7. Find a non-empty subset S of Q that is bounded above such that S does not contain 
its least upper bound. (Contrast with Theorem 3.5.12.) 


3.7 Construction of the field R of real numbers 


In this section we construct real numbers from Q via Dedekind cuts: a Dedekind cut 
stands for a real number and vice versa. We establish the familiar arithmetic on R, and 


in the subsequent section we show that R is an ordered field with further good properties. 


To go through these two sections carefully would take many hours. I typically spend 1.5 
class hours going through them, asking the students not to take notes but instead to nod 
vigorously in agreement and awe at all the constructions and claims. I expect the students 
to get the gist of the construction, a conviction that any arithmetic and order property 
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of real numbers can be verified if necessary, but I do not expect the students to have 
gone through all the details. Furthermore, I do not use the standard Dedekind cuts, as 
arithmetic with them is messier (see Exercise 3.7.12). 


Definition 3.7.1. A Dedekind cut is a pair (L, R) with the following properties: 
(1) L,R are non-empty subsets of Q. 
(2) For alll € L andallre R,l<r. 
(3) For alll € L, there exists lI! € L such that 1 <I’. 
(4) For all r © R, there exists r' € R such that r' <r. 
(5) LUR is either Q or Q \ {m} for some m € Q. In case LU R = Q\ {m}, then for 
allle€ EL andallre R,l<m<r. 


By condition (2) and the trichotomy property of < on Q, LN R= 9. 

We can visualize a Dedekind cut (LZ, R) as the separation of the rational number 
line into the left and right parts, and the separation comes either at a rational number m, 
or possibly at something else. The letters “DL” and “R” are mnemonics for “left” and 
“right” . 

An easy consequence of the definition is that if 1 © DZ and x € Q with x < l, then 
x € L, and similarly that ifr € R and x € Q with x > 1, then x € R. (See Exercise 3.7.1.) 


Examples and non-examples 3.7.2. 
(1) (Q\ {0}, {0}) is not a Dedekind cut: it violates conditions (2) and (4). 
(2) {x €Q: a2? < 2h,fx € Q: x? > 2}) is not a Dedekind cut: it violates condi- 
tion (2). 
(3) ({{a €Q:2<0},{@ €Q: az > 0}) is a Dedekind cut. 
(4) ({x €Q:a <0}, {x €Q: x > O}) is not a Dedekind cut: it violates condition (3). 


Theorem 3.7.3. Let (L,R) be a Dedekind cut. Let x € Q. 
(1) Suppose that x <1 for somel € L. Then z € L. 
(2) Suppose that x > r for somer € R. Then z € R. 


Proof. We only prove the first part. By condition (2), x is not in R, and by condition (5), 
wee: 


Definition 3.7.4. We define R to be the set of all Dedekind cuts. Elements of R are 
therefore Dedekind cuts, but more normally, the elements are called real numbers. 


The following theorem identifies a large collection of Dedekind cuts. 
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Theorem 3.7.5. For anym € Q, Dy = ({1E Q: 1 < m},{r © Q:r > m}) is a Dedekind 
cut. 
The union of the left and the right parts of D,, equals Q \ {m}. 


Proof. Let L = {1€ Q:l<m}, R={re€Q:r> mb}. By Theorem 2.7.11, 1 > 0, so 
that by compatibility of < and + on Q, m+1>mandm—1<m. Thus m—1€L and 
m+1eéR. Thus L and R are not empty. 

By trichotomy on Q, for every r € Q, exactly one of the following holds: r < m, 
r=m,r>m. SoLUR= Q\{m}. By definition, for any! € Landanyre R,l<m<r, 
and by transitivity, we conclude that | < r. 

If 1 € L, then 1 < m, and by Exercise 2.7.15, l’ = (1+ m)/2 is a rational number 
strictly larger than / and strictly smaller than m. Thus, | < I’ € L, and condition (3) of 
Dedekind cuts holds. Similarly, for any r € R, (r+m)/2 is in R and strictly smaller than r, 
so that condition (4) of Dedekind cuts is satisfied. 


Theorem 3.7.6. The function Q — R defined as taking m € Q to Dy» is injective. 
Thus we think of Q as a subset of R. 


Proof. If m,n are distinct in Q, say m < n, then by Exercise 2.7.15, (m+ n)/2 is in the 
right side of D,, and in the left side of D,, so that D,, and D, are also distinct. This 
proves that the function is injective. 


Many differences between Q and R are in Section 3.8. Here is the first one: 


Theorem 3.7.7. R contains Dedekind cuts other than those in Q. 


Proof. We prove that ({2 € Q: x7 < 2oraz < O},{x € Q: 27 >2Qandz > O}) isa 
Dedekind cut that is not of the form D,, for any m € Q. 

Set L= {xe Q: 2% <2or¢e <Oband R={r@eQ: 2? > 2andz > 0}. Then 
O0¢ Land 2€ R, so that L and FR are not empty. 

Let [,,l2 € Land r € R with 1, < 0 and EG <2. 

By transitivity of < since r > 0 we have that 1; < r. By definition, 13 < 2 < r?, so 
that by Theorem 2.9.2, lg < r. In particular, 0 < r. 

By Exercise 2.7.15, 1, < 1,/2 <0, and 1,/2 € L. By assumption, 2 —/2 > 0. Thus by 
Theorem 3.6.8 there exists p € N* such that 2ly < (2—13)p. Hence 2l2/p—1/p? < 2l2/p < 
2-12, so that (lg + 1/p)? = 2 + 2le/p — 1/p? < 2. Hence lg + 1/p € L and lg + 1/p > lo. 

By assumption, r? — 2 > 0, so that by Theorem 3.6.8 there exist p;,p2 € Nt such 
that 2r < pi(r? — 2) and por > 1. Set p = max{pi, po}. Then 2r < p(r? — 2) and pr > 1. 
It follows that 2r/p—1/p? < 2r/p < r? —2. Hence 2 < r? — 2r/p+1/p? = (r —1/p)?. By 
the condition pr > 1, it follows that r—1/pe Randr—1/p<r. 
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Let x € Q. If x < 0 then x € L. Now suppose that x > 0. By trichotomy, one of 
the following holds: x? = 2, 2? < 0, 7 > 0. By page 21, 2 is not the square of a rational 
number, so the first option is not possible, which proves that x € LU R. 

This proves that (ZL, R) is a Dedekind cut. Since LU R = Q, it follows that (L, R) 
cannot equal D,, for any m € Q. 


Theorem 3.7.8. Let L be a non-empty proper subset of Q such that for alll € L and all 
x €Q, ifa <l then € L. Then there exists a unique subset R of Q such that (L, R) is a 
Dedekind cut. 

Let R be a non-empty proper subset of Q such that for all r € R and all x € Q, if 
x >rthenxz € R. Then there exists a unique subset L of Q such that (L, R) is a Dedekind 
cut. 


Proof. We only prove the first part. Set R’ = Q\ L. 

If condition (4) of Dedekind cuts holds for (L, R’), let R = R’. Otherwise, there 
exists m € R’ such that for all r’ € R’, r’ > m. In this case set R = R’ \ {m}. Now let 
r © R. Then r > m. By Exercise 2.7.15, (r + m)/2 is a rational number strictly larger 
than m and strictly smaller than r, so that by assumption (r + m)/2 € R. Thus condition 
(4) of Dedekind cuts holds for (L, R). 

Since L is a proper non-empty subset of Q, so is R’, and even the modified R in 
the previous paragraph. If / © LZ and r € R, by construction | # r and even | < r. Thus 
conditions (1) and (2) hold, and condition (3) holds by assumption. Furthermore, condition 


(5) holds by construction as well. 


Now we turn to the arithmetic on R. 


Definition 3.7.9. Let A and B be subsets of Q. Define 
A+B={a+b:ae€Aand be B}, 
—A={-a:ae A}, 
A-B={a—b:a€Aandbe B}=A+(-B), 
A-B={a-b:a€A and be B}. 


Theorem 3.7.10. Let (Li, Ri) and (L2, R2) be Dedekind cuts. Then (Li + L2, Ri + R2) 
is a Dedekind cut. 


Proof. Since Ly, Lz, R,, Re are non-empty subsets of Q, then Ly + Lz, R, + Re are non- 
empty subsets of Q as well. Let 1 © DL, + Log and r € Ry + Ro. Then /] = 1, + ly and 
r=7r,+ 12 for some 1; € L; and r; € R; (¢ = 1,2). Then 1, < ry and Ig < rg, so that 
l=h+lg<k+re<7r1+r2 =r, and by transitivity of < on Q, 1 <r. Thus conditions 
(1) and (2) of Dedekind cuts hold. 


128 Chapter 3: Construction of the number systems 


Conditions (3) and (4) hold similarly. Furthermore, if 1 € LZ; + Lz and x € Q 
with x < 1, then we claim that x € L, + Lo. Namely, write | = 1; + I. for some |; € Lj, 
i = 1,2. By assumption we have that x—I+], < l), and since (11, R;) is a Dedekind cut, by 
Exercise 3.7.1, we conclude that «—/+1, € L,. Thus xa = x—I14l = 27-141, 4l,g € [14+ Lp. 
Similarly, whenever r € R, + Ro and x € Q with r < xz, then x € R, + Ro. 

It remains to prove Condition (5) of Dedekind cuts for (£1 +L2,Ri+R2). Let x € Q 
and suppose that x ¢ (£, + Lz) U(Ri + R2). We will prove that (LZ; + Lz, R; + Re) is the 
Dedekind cut D,. For this we need to prove that any y € Q not in (LZ; + Lz) U(Ri + Ro) 
equals x. By Remark 3.6.1 and Notation 3.6.5 we may write = ™, y = # for some 
m,n,p € Z, with n > 0. By the definition of Dedekind cuts the sets S; = {r ¢ Z: 7 € R;} 
are bounded below, so that S; has a least element a; € Z by Theorem 3.5.12. Thus © € R; 
for all integers c > a;, and + ¢ R; for all integers c < a;. In particular, — € Ry + Ro for 
all integers c > a; + ag. If (a; —1)/n € L; for i = 1,2, then £ € L, + Le for all integers 
c <a, +a2—2, so that necessarily x = y = aifa2—) So say that (a; —1)/n ¢ L,. Then by 
the definition of Dedekind cuts we have that D1 = Da,—1)/n- If also (a2 —1)/n ¢ La, then 
Ly = Diag-1)/n) « © £1 + L2 for all integers c < a, + a2 — 4, aitaa—3 = 201-3 + 242—3 € 


I, + Lo, fates = aot + sot € Ri + Ro, so that x = y= pt The remaining 


case is where Ly = D(g,—1)/n and Le # Diaz-1)/n- If Lo = Dy for some rational number 


r, then by possibly taking a multiple of n we may assume that r is of the form # for 
some integer g, but then we are in the previous case. So we may suppose that (Le, R2) 
is not a rational number. Then 2a2—1 is either in Ry or in Lg. If it is in Ro, then 
ital — 2u1—1 4 242-1 € Ri + Ro, so that necessarily z = y = , and if it is in 
Ly, then aitaa—2 a 2a1—3 + 2a2—1 € [, + Le, so that x = y = aitag— Tt, Thus Condition 
(5) holds. 


Thus we define addition on R by 


a,+ag—2 
n 


(Li, Ri) + (Le, Ro) = (Li + Lo, Ri + Re). 


There are two different meanings to “+” above: on the right side, + acts on two sets of 
rational numbers, and on the left, + stands for the new sum. 
The proof of the following is left as an exercise. 


Theorem 3.7.11. Addition on R, as defined above, is commutative and associative. The 
additive identity is Dy = (Q, Q*), and the additive inverse of any (L, R) € R is (—R, —L) 
(recall Definition 3.7.9). 


Naturally we write the additive identity Do of R as 0, and for any x € R, we write 


its additive inverse as —2. 


Additions on Q and R are compatible, in the following sense: 
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Theorem 3.7.12. For any m,n € Q, Dm + Dn = Dmtn (the first + is in R, the second 
in Q). 


Proof. We know that D,,, + Dy, is a Dedekind cut, and so we can write it as (L, R). 

We first prove that R= {r€Q:r>m-+n}. Let r € R. By definition of addition 
on R,r =rm+Prn for some r, > m and ry, > n. But then by compatibility of addition in 
Q with >, 


T=TTmtTn>Tmt+tn>m+tn, 


and by transitivity, r > m-+n. This proves that RC {r € Q:r >m+n}. Now suppose 
that r € Qwith r > m+n. Set fm = —f— +m andr, = —3— +n. By compatibility of 
< with addition on Q, we know that r—m—n > 0. By Theorem 2.7.11 we know ay 10; 
so that again by compatibility, 2= 1+ 1 > 0. Then by Theorem 2.10.4, —3— > 0, so 
that rm > mandr, >n. Finally, rm+rn = 7, which proves that {r€ Q:r> sea CR 
Thus R= {reQ:r>m+4+n}. 


Then by uniqueness of Dedekind cuts, Dy, + Dn = Dmin- 


Multiplication on R is more complicated, but if one keeps in mind the “expected” 


facts about real numbers, the following definition of multiplication on Dedekind cuts is 
natural: 
(,R-R’), if RNQ'T=RandRAQ = 
Rei',.), RAO =Rand-—L 10" =—L'; 
L L' /\ __ ( o] o] ’ 
en sh y= (L- R _), if R-NQ’ =R' and -LNQ* =- 
C,L-L), if RANQTA Rand FNQ'S RF, 


where in each case _ is the unique subset of Q that makes the result a Dedekind cut. It is 


only possible to find such a _ if the non-blank part on the right side satisfies the conditions 
in Theorem 3.7.8. It is left to Exercise 3.7.9 to verify that. (See Exercise 3.7.11 for an 
alternate formulation of multiplication.) 

Clearly - is a commutative operation, and it is easy to show that D, = ({# € Q:u4< 
1},{a € Q: a > 1}) is a multiplicative identity (and therefore the unique multiplicative 
identity by Theorem 2.5.5). The following are multiplicative inverses for (L,R) # Do: 

(L, R)-1 = { (_,{1/r:re R}), if RN QT = 
; ({1/l:leL},_), otherwise. 
If all elements of R are positive, we are certainly not dividing by 0 in the first case. In the 
second case, by Exercise 3.7.1, L contains only negative elements, and so again we are not 
dividing by 0. It is now straightforward to verify that the listed Dedekind cut is indeed 
the multiplicative inverse. 
I let the reader verify that for all m,n € Q, Dm - Dn = Dmn. 
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Proofs of associativity of multiplication and of distributivity of multiplication over 
addition are more involved. With the definition above, we would have to verify eight cases 
for associativity, and many cases for distributivity, depending on whether the left parts 
of the various Dedekind cuts contain positive elements. Yes, this can be done, but it is 
tedious work. We provide a different proof in this book that is not a brute-force attack but 
is more conceptual and gives side results as a bonus. We first need an intermediate result. 


Theorem 3.7.13. For any Dedekind cuts x,y, (—x)-y = —(a-y) =a: (—y). 


Proof. Write x = (L, R) and y = (L’, R’). 
If L,L’ contain positive elements, then —R,—R’ do not contain positive elements, 
and by definition of multiplication, 


(x-y) + ((-a) -y) = ((L, R)- (L', BF’) + ((-R, -L) - (1, B’)) 
=(4R-B)+ (+R) BR ) 
=(_,R-R)+(-(h- B),_): 
By Theorem 3.7.8, the left side U of (_, R- R’) is either Q\ (R- RB’) or Q\ ((R- BR’) U{m}) 
for some rational number m. Since R- R’ consists of positive numbers strictly greater 
than numbers in Q \ (R- R’) it follows that the left side is contained in the set Q” of all 
negative rational numbers. Furthermore, let r be any negative rational number. We want 
to find € U andy € R- RP’ such that r-y=r. Letx €U andy € R-R’. Write 
r=To/n, © = xo/n and y = yo/n for some integers ro, 20, Yo,n, with n positive. By 
possibly changing x and y but not n, we may assume that xo is the largest possible and yo 
the smallest possible. If (yo—1)/n € U, then yo = ao +1, and y—2 = yo/n—2o9/n = 1/n. If 
instead (yo —1)/n ¢ U, then (yo—1)/n = m, so that (2y9—1)/2n € R-R’, (Qyp—-3)/2n € U, 
and (2yo — 3)/2n — (2yo — 1) /2n = 1/2n. Thus we have found a representation of r as ro/n 
for some positive integer n and some negative integer rp such that for some integers xo, yo, 
xo/n EU, yo/n€ R- R’, and xo/n — yo/n = 1/n. But then (yo + (r—1))/n € R- R’ and 
xo/n — (yo + (79 —1))/n = 1r09/n = r. This proves that the left side of the Dedekind cut in 
the display equals Q”. Thus (xz - y) + ((—2)- y) = Do. By commutativity of addition this 
also gives ((—x) -y) + (x-y) = Do, ((—2) -y) = —(@-y). 
If L contains positive elements but L’ does not, then 


(w-y) + ((—2)-y) = ((L, R)- (L', R’)) + ((-R, -L) - (L', R’)) 
=, (R : Le) ae e —(R ; i), 
and similarly to the previous case, this is Do, so again ((—x)- y) = —(a- y). 


The remaining cases of ((—x) -y) = —(#- y) are proved similarly. Then by com- 
mutativity of multiplication, x - (—y) = (—y)- x, and by the established case this equals 


—(y-x)=—(x-y). 
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Theorem 3.7.14. R is a field properly containing Q. 


Proof. By Theorem 3.7.7, R properly contains Q. For elements of Q, the arithmetic on 


R agrees with the arithmetic on Q: for addition this was done in Theorem 3.7.12, and for 
multiplication the details are left to the reader. 


The only parts left to prove are that multiplication is associative and that it dis- 
tributes over addition. Let x = (L, R), y = (L’, R’), z = (L", R”) be in R. 


To prove that x-(y-z) = (a-y)-2z, it suffices to prove that the additive inverses of the 
two sides are identical. By Theorem 3.7.13, we may then at will in each product replace 
x = (L,R) by —x = (—R,—L), and similarly for y by —y and z by —z. Note that R does 
not contain only positive numbers if and only if —L contains only positive numbers, so via 
these replacements we may assume that R, R’, R” all contain only positive elements. But 
then x: (y-z) =(_,L-(L’- L")), which is by associativity of multiplication on Q the same 
as (_,(L- L’)- L”) = (a- y)- z. This proves associativity of multiplication. 


To prove that x-(y+z) = (a-y)+(ax-z), we may similarly assume that R and R’+ R” 
contain only positive numbers. Then either R’ or R” contains only positive elements, and 
by commutativity of addition we may assume that R’ contains only positive elements. From 
assumptions we have x: (y+ z) =(_,R-(R'+R")). fae R, be RP and ce R", then 
a-(b+c) =a-b+a-c by distributivity in Q, which proves that R-(R’+R”) C R-R'+R-R". 
We want to prove that equality holds in this inclusion. 

Suppose that in addition R” contains only positive numbers. Let r1,r2 € R, r’ € R’, 
rr’ € R". Then ryr’+rer” € R-R'+R-R”, and we want to prove that ryr’+rer” € R-(R’+ 
R"). We claim that (ry-r’+ro-r”)/(r' +r”) > min{ri, re}. If (rier! +re-r”)/(r’ +r") < 11, 
then by compatibility of multiplication with < on Q, ry-r’+ro-r” < ry-r’+ry-r", whence 0 < 
(r1—T2)-r”’, which forces r] > rp. But then (rir’+rer”)/(r’ +r”) > (rer! +rer")/(r' +r") = 
rg, which proves the claim. It follows by Exercise 3.7.1 that (ry -7r’ + r2-r”)/(r’ +r") © R, 
so that the arbitrary element 11-7’ + 1rg-r” = myn erg (! +r",eR, of R-R'+R-R" 
is in R-(R’+ R"). This proves that R- (R'+ R”) = R-R’'+R- R". Thus in this case 
e-(yt+z)=(,R-R+R-R )=(,R-R)+( RR) =r -yta-z. 


Now suppose that R” does not contain only positive numbers. Then —L” contains 
only positive numbers. Thus by the previous case x-(y+z)+a-(—z) = x-((y+z)+(—z)). By 
the already established associativity of addition, this equals x-y. Then by Theorem 3.7.13 


this says that x-(yt+z)=xv-yt+a2-z. 


The Dedekind cuts as in Theorem 3.7.5 are the familiar rational numbers, and if we 


establish some good order on R, we should be able to order the other Dedekind cuts among 


the rational ones as expected, to get that R consists of the familiar real numbers. 
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Exercises for Section 3.7 


3.7.1. Let (L,R) be a Dedekind cut, 1 € L, r € R, and x € Q. Prove the following: 
i) Ifa <J, then z € L. 
ii) Ifa >r,then ze R. 


3.7.2. Prove that there exists no R C Q such that (Q” U {0}, R) is a Dedekind cut. 
Similarly, there exists no L C Q such that (L,Q* U {0}) is a Dedekind cut. 


3.7.3. Let (L,R) be a Dedekind cut. 
i) If m € Qis the supremum of L, prove that R= Q\ (LU {m}). If L does not have 
a supremum in Q, prove that R= Q\ L. 
ii) If m € Q is the infimum of R, prove that L = Q\ (RU {m}). If R does not have 
an infimum in Q, prove that L = Q\ R. 


3.7.4. Let (L, R) bea Dedekind cut and let (L’, R’) = (L, R)-(L, R). Prove that R’ C Qso. 
3.7.5. If m and n are rational numbers, prove that D,, + Dy = Dm+n. 


3.7.6. Leta = ({1 €Q:x2<Oand2? >2h{t#eQ:2?<2orz>0}),b=({f EQ: 
x<Oor 2? <3},{ve€Q:27 >3 and x > 0}). Compute a+b, a-b. 


3.7.7. Prove Theorem 3.7.11. 
3.7.8. Let (11, Ri),...,(Ln, Rn) be Dedekind cuts. Prove that 


(Upai Le, Ma1Re) 
is a Dedekind cut that equals (Lx, Ry.) for some k € {1,...,n}. 


3.7.9. Prove that multiplication on Dedekind cuts is well-defined. 


3.7.10. Let (LZ, R) be a Dedekind cut. Prove that there exists z € Q such that z € R and 
—z € L. Prove in addition that for any a € Q, there exists such z with z > |al. 


3.7.11. For any set S C Q, define Si, = SNQ, and S_=SNQ_. Prove that for any 
Dedekind sets (L, R), (L’, R’), (L, R)- (L’, R’) = (U,V), where 
US SR LE SP ies Dice Be 2g 
V=L_-L_+R_-L,4+L,-R_+Ry-R4, 


and where A+0=AandA-=9. 


3.7.12. The standard Dedekind cut in the literature is defined as the pair (LZ, R) with 
the following properties: 

(1) L,R are non-empty subsets of Q. 

(2) For allie Dandallre R,l<r. 

(3) For alll € L, there exists I’ € L such that | < I’. 

(4) LUR=Q. 
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What should be the definition of the sum of two standard Dedekind cuts? With the 
definition above, why does (Li, Ri) + (Lo, Re) = (Li + Le, Ri + R2) not work? Why does 
—(L, R) = (—R,—L) not work? 
3.7.13. Let x,y € R. 
i) Prove that x? = (—2)?. 
ii) Suppose that 2? = y?. Prove that either 2 = y or x = —y. 


*3.7.14. Let x,y € R such that x? = y?. Prove that x = y. 


3.8 Order on R, the Least upper bound theorem 


Definition 3.8.1. We impose the following order on R: (L,R) < (L’, R’) if L C L’, and 
(L,R) <(L’,R') if LCL’. 


Theorem 3.8.2. R is an ordered field. 


Proof. By Theorem 3.7.14, R is a field. Let (ZL, R), (L’, R’), (L", R”) ER. 

If LZ L’, let le L\ L’. Let l’ € L’. By Exercise 3.7.1 applied to the Dedekind cut 
(L,R),l>VU andl’ € L. This proves that L’ C L, and since L' ¥ L, it follows that L’ ¢ L. 
This proves that at least one of L ¢ L’, L = L’, and L’ C L holds. But no two of these 
conditions can hold simultaneously, so exactly one of the three conditions holds. Hence by 
definition of order on R, exactly one of the following conditions holds: (L,R) < (L’, R’), 
(L, R) = (L’, R’), (L, R) > (L’, R’). This proves trichotomy of <. 

Suppose that (L, R) < (L’, R’) and (L’, R’) < (L", R"). Then by definition, LD ¢ L’ ¢ 
LL”, so that L ¢ L”, which says that (L, R) < (L”, R”). Thus transitivity holds for <. 

Suppose that (L,R) < (L’,R’). This means that L ¢ L’, so certainly LD + L” C 
DT’ +L". If equality holds, then (L, R) + (L", R”) = (L’, R’) + (L", R”). By adding the 
additive inverse of (L”’, R’”) whe then conclude that (L, R) = (L’, R’), which contradicts the 
assumption (L, R) < (L’, R’). Thus necessarily 0+ L" CL’+L", so that (L, R)+(L", R"”) < 
(L’, R') + (L", R”). This proves that < is compatible with addition. 

Finally, suppose that (L,R) < (L’, R’) and that Do < (L”, R”). Then by compati- 
bility of < with addition, Do < (L’, R')—(L, R) = (L', R') + (—R,—L) = (L'— R, R’—L). 
Thus {cf € Q: « < 0} CL’ — R, which means that R’ — L contains only positive 
elements of Q. Thus by definition of multiplication, ((L’,R’) — (L,R))-(L",R") = 
(_,(R’ — L)- R”), and so Do < ((L’, R’) — (L,R))- (£", R”). But then by associativ- 
ity, Do < ((L’, RB’) - (L", R")) — ((£, RB) - (L", R")), so that by compatibility of < with 
addition, (L, R)- (L”, R”) = Do + ((L, R)- (£", R")) < (L’, RB’) - (£", R"), which proves 
that < is compatible with multiplication by positive numbers. 
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Thus the following terms and facts, developed in Sections 2.7 and 2.10, apply to 


elements of R: positive, negative, non-positive, non-negative, intervals, absolute 


value, triangle inequality, reverse triangle inequality. 


The following is a rephrasing of Theorem 2.10.4 in this context: Suppose that a,b € R 
have the property that |a — b| < ¢ for all real numbers € > 0. Then a = b. 


Theorem 3.8.3. (Archimedean property) Let m,n € R. Ifm > 0, there exists p € NT 
such that n < pm. 


Proof. Write m = (L,R), n = (L’,R’). Since m > 0, L contains a positive rational 
number x. Let y € R’. Then by the Archimedean property for Q, Theorem 3.6.8, there 
exists p € Nt such that y < px. Then 
n= (Le) 

< Dy, (since y € R’) 

< Dpz (since y < px) 

= pD, (by multiplication in Q and R) 

<p-(L, R) (since x € L 

and by compatibility of < with multiplication by positive p) 


= pm. 


Theorem 3.8.4. For any two distinct real numbers there is a rational number strictly 
between them. 


Proof. Let a,b € R with a < b. Write a = (L, R) and b = (L’, R’) as Dedekind cuts. Since 
a <b, it means that LD ¢ L’. Let r’ € L’ \ L. By definition of Dedekind cuts, there exists 
r € L’ such that r >r’. Thus LC {x € R: a2 <r}CUL’, so that a < D, < 6, which by 
identification r = D,; proves the theorem. 


One of the main properties that distinguishes R from Q is the following theorem: 


Theorem 3.8.5. (Least upper/greatest lower bound theorem) Let T be a non- 


empty subset of R that is bounded above (resp. below). Then sup(T) (resp. inf(T’)) exists 
in R. 


Proof. First suppose that T is bounded above. Let U = Ur, ryerL. In other words, U is 
the union of all the left parts of all the Dedekind cuts in JT’. Since T is not empty, neither 
is U. Now let (Uo, Vo) € R be an upper bound on T. Then by definition L C Up for all 
(L,R) € T. Thus U C Up. This implies by Theorem 3.7.8 that there exists V C Q be 
such that (U,V) is a Dedekind cut. By definition, (U,V) is an upper bound for T, and this 
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upper bound is less than or equal to an arbitrary upper bound (Up, Vo). This proves that 
sup(T) exists in R. 


The proof for the infimum is left to Exercise 3.8.2. 


By Theorem 2.7.6, sup(7’) and inf(7’) are unique. 


Remark 3.8.6. In contrast, note that the non-empty bounded set {rv € Q: a? < 2} 
has no infimum nor maximum in Q. But this same set, as a subset of R, has infimum 
({c€Q : 2? > 2},_) and supremum (_,{x € Qt: 2? > 2}) inR. 


We use this Least upper bound theorem in the proofs of the Intermediate value 


theorem Theorem 5.3.1, the Extreme value theorem Theorem 5.2.2, and in many limit 
tricks for functions and sequences. Here is the first example of its usage. 


Theorem 3.8.7. The ceiling function makes sense on R, i.e., there exists a function | | : 


R — Z such that for all x € R, [x] > x and there exists no integer n with the property 
ae Sees 


Similarly, there exists the floor function | |: R > Z such that for all x € R, |x| <2 


and there exists no integer n with the property |x| <n <1. 


Furthermore, |x| = —[—<]. 


Proof. Let T be the set of all integers that are greater than or equal to x. By the 
Archimedean principle (Theorem 3.8.3) applied to xz and 1 > 0 there exists a positive integer 
p such that x < p-1. By the Least upper bound theorem (Theorem 3.8.5), the infimum of 
T is areal number. We call it 7. We need to prove that r € T’.. Since x is a lower bound on 
T, necessarily x < r. Since r is the infimum of 7’, then for every positive number ¢€ there 
exists t. € T’ such that 0 < t,—r < €. Suppose that r At). Then0 < t,_,-r <t,-r <1, 
so that by compatibility of < with addition, 0 < ty —t,-, < ti —r < 1, which contradicts 
Theorem 3.1.7. Thus r = t, is an integer. By the least upper bound theorem, x7 < r and r 
is the least integer with this property. Hence r = [a]. 

Now let m = —|[—a]. Then by definition m is an integer with the properties that 
—x < —m and that there is no integer n such that —x <n < —m. Hence by compatibility 
of < with addition, x > m and there is no integer n such that « > —n > m, i.e., there is 
no integer n such that x >n >m. This proves that m = |x| and hence finishes the proof. 
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Theorem 3.8.8. Between any two distinct real numbers there is a rational number 
(strictly between them). 


Proof. Let x,y € R with x < y. Then y — x > 0, and by the Archimedean property, there 
exists a positive integer p such that 2 < p(y—«). Let r = 5 ([Pr] +1) (recall that [pz] 
is the ceiling function of pr). So r is a rational number. By the definition of the ceiling 


1 


function, px < |[px| < [px]+1. Since p is positive, so is p-*, and by compatibility of < with 
multiplication by positive numbers, x7 < 5 [pe] +1) =r. Furthermore, pr+1 < 2+px < py 


by the choice of p, so that [px] +1 < py and finally r = 5 (px | +1)<y. 


Remark 3.8.9. It is also true that between any two real numbers there is an irrational 
number. Namely, let « < y be real numbers. It is proved on page 21 that V/2 is a 
positive irrational number. By compatibility of < with multiplication by positive numbers 
then x/2 < yV2. By the Archimedean property, there is a rational number r such that 
V2 <r < yV2. If r = 0, again by the Archimedean property there exists a rational 
number s such that r/2 < 0 < s < yvV2. So by possibly replacing r by this s we 
may assume that r is a non-zero rational number. Then again by compatibility and by 
Theorem 2.7.11, x < r/V/2 < y. But r is non-zero, so that r/\/2 is an irrational number 
strictly between the given real numbers x and y. 


I recommend that in the first pass through the next theorem the reader works with 
concrete n = 2. 


Theorem 3.8.10. (Radicals exist in R.) Let y be a positive real number and n € NT. 
Then there exists a unique positive real number s such that s" = y. 


Proof. Let T= {r €R:0<r andr” < y}. Then T contains 0, so it is non-empty. If for 
some ré€T,r> [y], thenr > y>r", so that by compatibility of multiplication /division 
by positive numbers, 1 > r. But y is positive, so [y] is bigger than or equal to 1, so that 
1>r> fy] => 1, which by transitivity of > contradicts the trichotomy. So necessarily 
for allr € T, r < [y]. This means that T is bounded above. By the Least Upper Bound 
theorem of R, there exists a real number s that is the least upper bound on T’. Necessarily 
s>OasOeET. 

We claim that 5s” = y. 

Suppose first that s” < y. [WE WANT TO FIND A SMALL POSITIVE NUMBER q SUCH 
THAT (s+q)” < y, WHICH WOULD SAY THAT s+q € T AND THUS GIVE A CONTRADICTION 
TO s BEING AN UPPER BOUND ON T. THE SMALL NUMBER q MIGHT AS WELL BE OF THE 
FORM . FOR SOME (LARGE) POSITIVE INTEGER p. By EXERCISE 1.7.7 WE REALLY WANT 
TO FIND SOME p € N* sucH THAT )7y_ (1)s*/p"—* < y. IN OTHER WORDS, WE NEED 


p € N* sucu THAT pia (i) s*/pr-* <y—s". Bur ee. (s*/p PS Sia (2)s*/p, 
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SO IT SUFFICES TO FIND p SUCH THAT S77 ~, (7)s*/p < y—8”.] By the Archimedean 
property there exists a positive integer p such that Sas ie) s* < p(y—s”) We know that 
7 > 0. Since p > 1, by compatibility of < (and <) with multiplication by the positive 


number - we have that 1 > a Thus 


and by compatibility of < with addition, $77, bee /p"—-* < y. Hence by Exercise 1.7.7, 
(s+1/p)" < y. Since s+ 1/p > s > 0, it follows that s+ 1/p € T, which contradicts the 
fact that s = sup 7’. This proves that s” > y. 

Now suppose that s” > y. Then in particular s > 0. By the Archimedean property 
there exist positive integers p; and p2 such that 1 < p,s and Aa a) s* < po(s"—y). Let 
p = max{pj,po}. Then 1 < ps and oar & s* < p(s” —y). Thus sg = s — 5 is positive, 
and 


Hence by compatibility of < with addition, y < (s— a But then for allr € T, rr" <y < 
(s — a Since s — : is positive, by the increasing property of the power functions on R* 
(Theorem 2.9.2) we have that r < s — : for allr € T. But then s — is an upper bound 
on T’, which contradicts the fact that s is the supremum of T’. 


Then by trichotomy on R we conclude that s” = y. 


Now suppose that t is another positive real number such that t” = y. By trichotomy, 


either t < s or s < t. Then by the increasing property of the power functions on R™, 


t” £ s”, which is a contradiction. This proves that s is unique. 


We call the number s such that s” = y the nth root of y, and we write it as 7/y or 
1/n 
as y/”. 


Exercises for Section 3.8 
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3.8.1. Prove that the following conditions are equivalent on Dedekind cuts (L, R), (L’, R’): 
i) (L, R) < (L’, R’). 
ii) For all / € L there exists I’ € L’ such that 1 <I’. 
iii) For all 1 € L there exists I’ € L’ such that | < I’. (Hint: condition (3) of Dedekind 
cuts.) 
3.8.2. Let T be a non-empty subset of R that is bounded below. Let (U,V) be the 
Dedekind cut with U = — Ur, ryer (—R). Prove that (U,V) is the infimum of T. 


3.8.3. Let m € R be a positive number. Prove that there exists a positive integer N such 
that 1/2% <m. 


3.8.4. Compute the least upper bounds and greatest lower bounds, if they exist, of the 
following subsets of R: 

i) {1/n:n a positive integer}. 

ii) {1/n : n a positive real number}. 

iii) {1/n:n> 4}. 
iv) {(-1)": n € Z}. 
) {2-" : mn € No}. 

vi) {2-"/n:n a positive integer}. 
vii) {n/(n+1):n € No}. 


Vv 


3.8.5. Let n be a positive odd integer and let y be a real number. Prove that there exists 
a real number «x such that x” = y. 


3.8.6. Let S = {)>¢_,(-l)*%:n © N*}. Prove that 9 is bounded above and below. 
3.8.7. Find the least upper bound of {779 gis :m € N*}. (Hint: Example 1.6.4.) 


3.9 Complex numbers 


By Exercise 3.7.4 there is no real number x such that x? = —1. In this section we 
build the smallest possible field containing IR with an element whose square is —1. We 
proceed by defining a new set C with new operations + and -, we verify that the result is 
a field that contains R as a subfield and that has two elements whose squares equal —1. 
It is left to an interested reader to show that there are no fields strictly between R and C 
(that contain a root of —1). 
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Definition 3.9.1. Let C = R x R (the Cartesian product). Elements of C are called 
complex numbers. Define binary operations + and - on C: 


(a,b) + (c,d) = (a+c,b+d), 
(a,b) - (c,d) = (ac — bd, ad + bc). 


We represent complex numbers in the real plane like so: 


The horizontal axis, on which all complex numbers are of the form (r,0), is called 
the real axis, and the vertical axis, on which all complex numbers are of the form (0,7), 
is called the imaginary axis. 

The illustration below shows a geometric interpretation of addition: to add (a,b) and 
(c,d), draw the parallelogram using these two points and (0,0) as three of the four vertices. 
Think through why the sum is the fourth vertex. Because of this picture we loosely say 
that addition in C obeys the parallelogram rule. 


(a+c,b+d) 


We make algebraic sense of multiplication in Notation 3.9.3 and a geometric inter- 
pretation of multiplication is in Theorem 3.12.1. 


Theorem 3.9.2. C is a field. 


However, C is not an ordered field in the sense of Definition 2.7.10, and a rigorous 
proof is left for Exercise 3.9.6. 
Rather than giving a formal proof that C is a field, below is a list of the necessary 
easy verifications. The reader is encouraged to verify all. 
(1) -, + are associative and commutative. 
(2) - distributes over +. 
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(3) For all x € C, (0,0)+a =a. In other words, C has the additive identity 0 = (0,0). 
(The additive identity of a field is written as “0” even when it is an ordered pair 
of real numbers. ) 

(4) For all (a,b) € C, (—a, —b)+(a, b) = (0,0). In other words, every element (a, b) has 
an additive inverse —(a,b) = (—a, —b). By Theorem 2.5.10, the additive inverse is 
unique. 

(5) For all  € C, (1,0)-2 =. In other words, C has the multiplicative identity 
1 = (1,0). (The multiplicative identity of a field is written as “1” even when it is 
an ordered pair of real numbers.) 

(6) (1,0) 4 (0,0). ie., 140. 

(7) Every non-zero element has a multiplicative inverse. Namely, for any (a,b) € (0,0), 
(ape aie) € C and (aie, sar) - (a,b) = (1,0). By Theorem 2.5.10, the 
multiplicative inverse is unique so that (a,b)~! = (aoe san) 

For example, the multiplicative inverse of (1,0) is (1,0), the multiplicative inverse of 
(0,1) is (0,—1), and the multiplicative inverse of (3,5) is (4,-3). 
The squares of the complex numbers (0,1) and (0,—1) are (—1,0) (check!). Thus we 


have found two complex roots of the polynomial function x? + (1,0). An identical proof to 
that of Theorem 2.4.15 shows that these two complex numbers are the only two roots. 


Notation 3.9.3. There is another notation for elements of C that is in some ways better: 
(a,b) = a+bi, with (a,0) = a and (0, b) = bi. This notational convention does not lose any 
information, but it does save a few writing strokes. Addition is easy: (a + bt) + (c+ di) = 
(a+b) + (c+ d)i, the additive inverse of a + bi is —a — bi, the additive identity is 0. This 
notation justifies the possibly strange earlier definition of multiplication in C: 
(a, b) - (c,d) = (a+ bi)(e + di) = ac + adi + bic + bidi = (ac — bd, ad + bc). 

The multiplicative identity is 1 and the multiplicative inverse of a non-zero a+ bi is re 
Definition 3.9.4. The real part of (a,b) is Re(a,b) = a, and the imaginary part is 
Im(a, b) = b. In alternate notation, Re(a+ bi) = a and Im(a-+ bi) = b. Note that both the 


real and the imaginary part of a complex number are real numbers. 


We next identify R as a subset of C: the operative word here is “identify”, 


as R is not a subset of R x R, i.e., real numbers are not equal to ordered pairs of real 


numbers. Nevertheless, with the natural identification of any real number r with the 
complex number (r,0) =r+7-0= r, we can think of R as a subset of C. 

We thus have the following natural inclusions, all compatible with addition and 
multiplication: 


N CZCQCRCEC 
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These number systems progressively contain more numbers and more solutions of more 
equations. For example, the equation 1 + 2 = 0 does not have any solutions in No but it 
does have one in Z; the equation 1 + 2% = 0 does not have any solutions in Z, but it does 
have one in Q; the equation 2 — x? = 0 does not have any solutions in Q, but it does have 


two in R (namely 2 and — V2); the equation 2+ 2? = 0 does not have any solutions in R, 
but it does have two in C (namely /2i and —/2i). Furthermore, the standard quadratic 
formula always yields roots of quadratic equations in C.* 


Exercises for Section 3.9 
3.9.1. Write the following elements in the form a+ bi with a,b € R: 


tae 

ji) tN7=34) 

N) (ii) (342%) ° 

iii) (2 — 32)°. (Hint: Exercise 1.7.7.) 


© a’. What do you observe? 


3.9.2. Let x = v3 + 4 Draw x, 27, x°, x4, 2°, & 

3.9.3. Draw the following sets in C: 

i) {x : the real part of x is 3}. 

ii) {~ : the imaginary part of x is 3}. 

iii) {2 : the product of the real and imaginary parts of x is 3}. 
) 


iv) {x : the product of the real and imaginary parts of x is 0}. 


3.9.4. Let x € C such that x? = —1. Prove that either 2 = i or x = —i by using definitions 
of squares of complex numbers and without invoking Theorem 2.4.15. 


3.9.5. Prove that the only « € C with x? =0 is x = 0. 


3.9.6. Prove that C is not ordered in the sense of Definition 2.7.10. Justify any facts (such 
as that 0 < 1). 


* One of the excellent properties of C is the Fundamental Theorem of Algebra: every polynomial with 
coefficients in C or R has roots in C. The proof of this fact is proved in a junior-level class on complex analysis or 
in a senior-level class on algebra. The theorem does not say how to find the roots, only that they exist. In fact, 
there is another theorem in Galois theory that says that in general it is impossible to find roots of a polynomial 
by using radicals, sums, differences, products, and quotients. 
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3.10 Functions related to complex numbers 
We have established existence and properties of several functions of complex numbers: 


(1) Inclusion of R into C: f(r) =r =r+0-i= (r,0). 
(2) Identity function on C: idc(x) = x. (Say why this function is different from the 


function in part (1).) 
(3) Real part Re: C > R. 
(4) Imaginary part Im: C > R. 
(5) Additive inverse — : C > C. 
(6) Multiplicative inverse er :(C\ {0}) 6 C. 
(7) Addition +:C x CC. 
(8) Multiplication -:C x C>C. 
(9) Scalar multiplication functions: for any z € C, multiplication by z is a function 
with domain and codomain C: f(x) = za. 
There are further obvious functions: 
(10) A function f : C > C is called polynomial if there exist ag, ai,...,a@, € C such 
that for all x € C, f(x) = ag + a,x + agx? +--+ +an2”. 
(11) A function is called rational if it equals a polynomial function divided by a poly- 
nomial function. Examples of rational functions are polynomial functions, as well 


er +i 
xr—31° 


as such functions as eat and 
From now on, when the domain of a function is not given explicitly, we take the 
domain to be the largest possible subset of C on which the function makes sense. (Before 
we took the largest possible subset of R; see Notation 2.4.5.) So, the domain of the last 
two functions in the previous paragraph are C \ {7, —i} and C \ {3:7}. 
There is one more very important and powerful function on C, which at first may 
seem unmotivated: 


Definition 3.10.1. The complex conjugate of a+ hi isa+ bi =a — bi. 


Geometrically, the complex conjugate of a number is the reflection of the number 
across the real axis. 


Theorem 3.10.2. Let x,y € C. Then 
(1) « £0 if and only if & F 0. 
(2) « =@ if and only ifx ER. 
(3) The complex conjugate of the complex conjugate of x equals x. In symbols: & = x. 
(4) oFy=r+y. 
(5) ZY==-¥. 


(6) Ify £0, then (x/y) = 7/9. 
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Proof. Write x = a+ bi and y = c+ di for some a,b,c,d € R. Certainly x = 0 if and only 
if a = b = 0, which holds if and only if = 0. This proves (1). 

Certainly « = % if and only if a+ bi =a — bi, ie., if and only if (a,b) = (a, —b), and 
that holds if and only if b= 0. Thus x =7 if and only if =a €R. This proves (2). 

The following proves (3): 7 = a— bi =a+bi = x. Addition in (4) is straightforward. 


The following proves (5): 


zy = (a+ bi) - (c+ dt) 
= ac — bd + (ad + bc)i 


= ac — bd — (ad + bc)i 
= (a — bi) - (c — di) 
=%-y. 


If y £0, then by (1) and (5), = = (x/y)y = x/y-7J, and so (6) follows. 


Remark 3.10.3. All functions above with domain and codomain equal to C were given 
with some sort of algebraic formulation or description. How else can we represent such 
a function? We certainly cannot give a tabular function formulation since the domain is 
infinite. But we cannot draw such a function either: for the domain we would need to draw 
the two-dimensional real plane, and the same for the codomain, so we would have to draw 
the four-dimensional picture to see it all, and that is something we cannot do. So we need 
to be satisfied with the algebraic or verbal descriptions of functions. 


Exercises for Section 3.10 
3.10.1. Let z,y € C. 
i) Prove that x + % = 2Rez. 

ii) Prove that x —& = 2iImz. 

iii) Prove that x-7¥+%-y = 2Re(Zy) = 2Rex Rey+2ImaImy. 
3.10.2. Let x be a complex number. Prove that x - % is real. 
3.10.3. Let x and y be non-zero complex numbers such that x-y is real. Prove that there 
exists a real number r such that y =r- Z. 


3.10.4. Let 2 be non-zero complex number such that x? is real. Prove that Re()-Im(a) = 
0, i.e., that either Re(x) or Im(z) is zero. 
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3.11 Absolute value in C 


We have seen the absolute value function in ordered fields. The Pythagorean theorem 


in the plane R x R motivates the natural definition of distance in C: 


(a,b) =a+bi 


Definition 3.11.1. The absolute value of a complex number (a,b) = a+ bi is |a + bi| = 
Va? +b? € R. The absolute value is also called the norm or the length. The distance 
between complex numbers x and y is |x — y]. 


Since the absolute value is a real number, this gives a way to partially compare com- 
plex numbers, say by their lengths, or by their real components. But recall Exercise 3.9.6: 
C is not an ordered field. 

The absolute value of (a, 0) = a or (0, a) = ia is |a|; the absolute value of (1,1) = 1+7 
is V2; the absolute value of (1, /2) = 1+i/2 is V3; the absolute value of (1, /3) = 1+iV3 
is /4 = 2, et cetera. 


Theorem 3.11.2. Let x,y € C. Then 
(1) |x| = |. 
(2) «-% = |x|? is a non-negative real number. 
(3) x =0 if and only if |x| = 0. 
(4) |Imz|,|Rea| < |a]. 
(6) dia 40, then a-* = z/\z/?, 
(6) |xy| = lal |yl- 
(7) (Triangle inequality) |x + y| < |x| + ly]. 
(8) (Reverse triangle inequality) |x + y| > ||x| — |y]|. 


Proof. Write x = a+ bi for some a,b € R. Then |z| = Va? +6? = \/a? + (—b)? = [zI, 
which proves (1). Also, x-% = (a+ bi)(a — bi) = a? — (bi)? = a? +b? = |z/?, and this 
is the sum of two non-negative real numbers, and is thus non-negative. This proves (2). 


Furthermore, since R is an ordered field, by Exercise 2.7.9, a”, b? > 0, so by Exercise 2.7.10, 
a? + b? = 0 if and only if a =b=0. This proves (3). 

Since b? > 0, it follows that a? < a? + b?, so that by Corollary 2.9.3, ja] = Va? < 
Va? + b?. This proves that |Rex| < |a|. Similarly |Ima| < |x|. This proves (4). 
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If « £0, then by (3), |x| is a non-zero (real, complex) number, and by (2), %/|a/? is 
the multiplicative inverse of x. This proves (5). 

We could prove (6) with straightforward but laborious algebra by using that x = a+bi 
and z = c+ di for some real numbers a,b, c,d, and expanding the relevant sides, but the 
following proof is better: 


Jxy|’ = (wy)(xy) (by (3)) 
= ryxy (by Theorem 3.10.2) 
= rzyy (by associativity and commutativity of - in C) 
= |z|?|y|? (by (3)) 
= (x|\y|)? (by associativity and commutativity of - in R). 
Now (6) follows by taking square roots of both sides. 
To prove the triangle inequality, we also use the squares of the desired quantities to 
avoid having to write the square root: 
ja + yl? = (w@+y)(@ £y) (by (2)) 
= (x+y)(%#+y) (by Theorem 3.10.2 (2)) 


= cet ryt y+ yy (by algebra) 
2+ xy + ary +|y|* (by Theorem 3.10.2 (2)) 
2+ 2Re(2y) + |y|? (by Exercise 3.10.1 iii)) 


| 
8 


| 
8 


< |a|? + 2) Re(ay)| + |y|? (comparison of real numbers) 
< |x|? + 2|xy] + |yl? (by (4)) 

= |x|? + 2|2||g] + |yl? (by (6)) 

= |x|? + 2|a||y| + |yl? (by (1)) 

= (|2| + |yl)*, 


and since the squaring function is strictly increasing on the set of non-negative real numbers, 
it follows that |a + y| < |a| + |y|. This proves (7), and by Theorem 2.10.3 also (8). 


A consequence of part (6) of this theorem is that if « € C has absolute value greater 
than 1, then positive integer powers of x have increasingly larger absolute values, if |x| < 1, 
then positive integer powers of x have get increasingly smaller than 1, and if |z| = 1, then 
all powers of x have absolute value equal to 1. 

The absolute value allows the definition of bounded sets in C (despite not having an 


order on C): 
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Definition 3.11.3. A subset A of C is bounded if there exists a positive real number M 
such that for all x € A, |a| < M. 


For example, any set with only finitely many elements is bounded: if A = 
{%1,...,%n}, set M = max{|x,|,...,|e,|}+ 1, and then certainly for all x € A, |z| << M. 

The subset Z of C is not bounded. The infinite set {2 € C: |a| = 5} is bounded. 
The set {i” : nm € N*} is bounded. The set {1/n:n € N*} is bounded. The set of complex 
numbers at angle 7/4 from the positive real axis is not bounded. (Draw these sets.) 


Exercises for Section 3.11 


3.11.1. Compute the absolute values of the following complex numbers: 
1,i,£V2,1+4,(1+4)/V2,3 + 4i. 


3.11.2. Draw the following sets in C: 
Tyas el Sh 
ii) {x : |e —2+4| = 3}. 
3.11.3. Let a € C, let B be a positive real number, and let A = {x € C: |x—a| < B}. 
Draw such a set in the complex plane assuming a ¥ 0, and prove that A is a bounded set. 
3.11.4. Let A be a subset of C. Prove that the following statements are equivalent: 
i) A is a bounded set. 
ii) There exist a € C and a positive real number M such that A C B(a, M). 
iii) For all a € C there exists a positive real number MW such that A C B(a, M). 
iv) For all a € R there exists a positive real number M such that A C B(a, MV). 
v) There exist a € R and a positive real number M such that A C B(a, M). 


3.11.5. Let F be either C or an ordered field, so that the absolute value function is defined 
on F. Let S be a subset of F'. Prove that S is bounded if and only if {|s| : s € S} is 
bounded. 
3.11.6. Let A and B be subsets of C. Define A+ B = {a+b:ae€ Aanddbe B}, 
A-B={a-b:a€Aand be B}, and for any ce€ C, let cA = {c-a: a € A}. Compute 
A+ B,A-B, and c- A for the following A, B,c: 

i) A= {1,2,i}, B={-1L i}, cH4. 

ii) A=R*, B=R* c= =1. 
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3.11.7. Let A be a bounded subset of C. 
i) Prove that for any complex number c, {ca : a € A} is bounded. 
ii) Prove that for any complex number c, {a +c: a € A} is bounded. 
iii) Prove that {a? : a € A} is bounded. 
iv) Prove that for any positive integer n, {a” : a € A} is bounded. 
v) Prove that for any polynomial function f, {f(a) : a € A} is bounded. 
3.11.8. Let a,b € C. Suppose that for all real numbers € > 0, Ja — b| < €. Prove that 
a =b. (Hint: Theorem 2.10.4.) 
*3.11.9. (Keep in mind that a square root function on C is yet to be discussed carefully; 
see Exercise 5.4.6.) Discuss correctness/incorrectness issues in the following equalities: 
i) -6 = (V3i)(V/12i) = /—3/—12 = \/(—3)(—12) = V36 = 6. 
ii) (R. Bombelli, 1560, when solving the equation x? = 15z + 4.) 


4= i/2 fay STO: i/2 sa) Ok 
iii) (G. Leibniz, 1675) 1+ /—3+ V1— V—3= V6. 


3.12 Polar coordinates 


So far we have expressed complex numbers with pairs of real numbers either in 
ordered-pair notation (x,y) or in the form x + yi. But a complex number can also be 
uniquely determined from its absolute value and the angle measured counterclockwise from 
the positive real axis to the line connecting (0,0) and (2, y). 


1, angle 7/4, 97/4,... 
radius 2, angles 7, —7, 


radius 2, angles 0, 27,... 


radius 1, angles 
radius 2, angle 57/3, —7/3,... 


Any choice of 6 works for the complex number zero. The angles are measured in 
radians. (While you may say degrees out loud, get into the habit of writing down radians; 
later we will see that radians work better.) The angle is not unique; addition of any integer 
multiple of 27 to it does not change the complex number. 
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For further examples, 1+iv3 is on the unit circle centered at the origin and is at angle 


m/3 counterclockwise from the positive real axis, Leis is on the same unit circle and at 


angle —7/3 counterclockwise from the positive real axis, and alti’ is on the same circle 


and at angle 27/3 counterclockwise from the positive real axis. 


We refer to the entries in the ordered pair (x,y) € R x R = C as Cartesian co- 
ordinates. The coordinates (r,@) consisting of the absolute value r of a complex number 


and its angle 6 (measured counterclockwise from the positive real axis) are referred to as 
polar coordinates. 


Numerical conversions between the two coordinate systems use trigonometry. If we 
know r and @, then x and y are given by: 


x=rcosé, 
y=Tsind, 


and if we know x and y, then r and @ are given by: 


r= VPTP, 

anything, se res Oe 

: if = 0 andy 0; 

5 if x =O and y < 0; 

arctan(y/x) € (—7/2,7/2), if x > 0; 

arctan(y/x) € (7/2,37/2), ifa <0. 

Note that the angle is +7/2 precisely when Rex = 0, that the angle is 0 when z is a 
positive real number, that it is 7 when x is a negative real number, et cetera. Furthermore, 
if the angle is not +7/2, then the tangent of this angle is precisely Ima/ Rez. 

We will show in Chapter 9 that the polar coordinates r,@ determine the complex 
number as re’, but at this point we cannot yet make sense out of this exponentiation. 
Nevertheless, this notation hints at multiplication of complex numbers re’? and se’? as 
resulting in rse*(?+9), confirming that the absolute value of the product is the product of 
the absolute values, and hinting that the angle of the product is the sum of the two angles 
of the numbers. 


We next prove this beautiful fact of how multiplication works geometrically. 


Theorem 3.12.1. (Fun fact) Let z be a complex number in polar coordinates r and 0. 
Define functions M,S,R:C— C as follows: 


M(x) = zx = Multiply x by z, 
S(x) =rx = Stretch x by a factor of r, 
R(x) = Rotate x by angle 6 counterclockwise around (0,0). 
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Then 
M=SoR=> oS, 


or in other words, multiplication by z is the same as stretching by r followed by or preceded 
by rotating by the angle @ counterclockwise. 


Proof. If z = 0 or x = 0, the conclusion is trivial, so we assume that x and z are non-zero. 
By the geometry of rotation, rotation and stretching by a positive real number can be done 
in any order, i.ec., Ro S(a) = R(rz) =rR(z) = So R(x). 

So it suffices to prove that Ro S(x) = M(x) for all x. We first prove this for the 
special cases x = 1 and x = 1, after which we prove it for general x. 

The angle of M(1) = z is @ and its length is r. But (Ro S)(1) = R(r) also has 
length r and angle 6, so that M(1) = (Ro S)(1). 

Write z = (c,d) for some c,d € R. Then M(i) = (—d,c), and we draw a few 


examples: 


The complex number M(i) = (—d,c) has length equal to |(c,d)| =r. The angle between 
(c,d) and (—d,c) is 90°, or 7/2 radians, and more precisely, to get from z = (c,d) to 
M(i) = (—d,c) we have to rotate counterclockwise by 7/2. Thus the angle formed by M (2) 
counterclockwise from the positive real axis is 0+ 7/2. But (Ro S)(i) = R(ri) also has the 
same angle and length as M(z), so that M(z) = (Ro S)(i). 

Now let x be general in C. Write s = a+ i for some a,b € R. By the geometry of 
rotation, R(a + bi) = R(a) + R(bi) = aR(1) + R(t). Then 


Ro S(x) = So R(x) (as established from geometry) 
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r(aR(1) + bR(i)) 

ar R(1) + br R(t) 

aS(R(1)) + bS(R()) 

aM (1) + bM(i) (by previously proved cases) 


= azl+ bz 
= z(a+ bi) 
= M(x). 


Theorem 3.12.2. For any non-zero complex number x and any integer n, the angle of x” 
counterclockwise away from the positive x-axis is n times the angle of x. Also, |x”| = |x|". 
Proof. If n = 1, this is trivially true. Now suppose that the theorem is true for some 
positive integer n. Then the angle of x”~! counterclockwise away from the positive x-axis 
is n — 1 times the angle of x, and by Theorem 3.12.1, the angle of x” = rx”! is the sum 
of the angles of x and x”~!, so that it is n times the angle of x. Similarly, by part (6) of 
Theorem 3.11.2, |2”| = eae NS = |a| ee = == gellar = lal, 

Thus by induction the theorem is proved for all positive n. 

Still keep n positive. Since 1 = 2~"x” has angle 0 and x” has angle n times the 
angle of x, by Theorem 3.12.1, x~” must have angle —n times the angle of x. Also, by 
part (6) of Theorem 3.11.2, 1 = |2~"| |x") = |a7™| |x|", so that |a—”| = |x|—". Thus the 
theorem holds for all non-zero n. 

Finally, if n = 0, then then angle of x” = 1 is 0, which is 0 times the angle of x, and 


[2°] = [1| =1=|2). 


For example, 1+ v3i 


is on the unit circle at angle 27/3 counterclockwise from the 


=i is on the 


positive x-axis (i.e., at angle 120° in degrees), and so the second power of 
unit circle at angle 47/3 counterclockwise from the positive x-axis, and the cube power is 
on the unit circle at angle 27, i.e., at angle 0, so that (=1tv31)3 a 

Corollary 3.12.3. Let n be a positive integer. Let A be the set of all complex numbers 
on the unit circle at angles 0, 2n 22, 322, we (Nn 1) 22 . Then A equals the set of all the 


complex number solutions to the equation x" = 1. 


Proof. Let a € A. By the previous theorem, a” has length 1 and angle an integer multiple 
of 27, so that a” = 1. If b € C satisfies b” = 1, then |b|” = |b”| = 1, so that the non- 
negative real number |b| equals 1. Thus 6 is on the unit circle. If @ is its angle, then the 
angle of 6” = 1 is by the previous theorem equal to n@, so that n@ must be an integer 
multiple of 27. It follows that @ is an integer multiple of 2m but all those angles appear for 
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the elements of A. Thus every element of A is a root, and every root is an element of A, 


which proves the corollary. 


Theorem 3.12.4. Let x be a non-zero complex number and let n be a positive integer. 
Then there exist exactly n complex numbers whose nth power equals x. 


Proof. By Theorem 3.11.2 we know that r = |z| is positive. By Theorem 3.8.10 there 
exists a positive real number s such that s” = r. Let a be the angle of x in radians 
measured counterclockwise from the positive real axis. For any positive integer j let u; be 
the complex number on the unit circle whose angle from the positive real axis is (a+277)/n. 
By the previous theorem, wu’? is on the unit circle at angle a + 277 counterclockwise from 
the positive real axis. But this is the same as the complex number on the unit circle at 
angle a counterclockwise from the positive real axis. Hence (su;)" = su? = ru’ is the 
complex number on the circle of radius r at angle a measured counterclockwise from the 
positive real axis. This says that (su,;)" = x. By angle considerations, suj, sug,..., SUn 
are distinct. This proves that there exist n complex numbers whose nth power equals x. 

Now let y be any complex number whose nth power equals x. By the previous 
theorem, |y| = s. Let @ be the angle of y measured counterclockwise from the positive real 
axis. Since y” = x, by the previous theorem, nG — a = 27k for some integer k. Hence 
6 = (a+ 2rk)/n. We can write k = k’n +7 for some integer j € {1,2,...,n} and some 
integer k’. Then 

B =(a+t 2rk)/n = (a+ 2rk1)/n + Qrk’, 


so that the angle ond the length of y are the same as those of su,;, so that y = su;. Thus 


there exist exactly n complex numbers whose powers equal x. 


Whereas for non-negative real numbers we choose its non-negative square root as 
the nth root, there is no natural choice for an nth root of a complex number; more on that 
is in the chapter on continuity in Exercise 5.4.6. 


Exercises for Section 3.12 


3.12.1. Write each complex number in the form a+ bi with a,b € R: 
i) Complex number of length 1 and at angle 7/4 measured counterclockwise from the 
positive real axis. 
ii) Complex number of length 1 and at angle —7/4 measured counterclockwise from 
the positive real axis. 


iii) The product and the sum of the numbers from the previous two parts. 
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3.12.2. Draw the following points in the real plane, and think about Theorem 3.12.1: 
i) 3 — 27, 7(3 — 22), 
ii) —2 —i,1(—2 —7). 
iii) 2 — 37, (1 + 2)(2 — 32). 
iv) 1—2,(1+7)(1 —-2). 
3.12.3. Draw in C = R x R the set 


{x : the angle of x counterclockwise from the positive real axis is 7/3}. 


3.12.4. Prove that for any non-zero z € C there exist exactly two elements in C whose 
square equals z. (Hint: Theorem 3.8.10 and Theorem 3.12.1.) 


3.12.5. Let z be non-zero in C with polar coordinates r and 6 and let n € N*. For and 


1/n and whose angle 


integer k, let z, be the complex number whose absolute value equals r 
measured counterclockwise from the positive x axis is k0/n. 

i) Prove that z, is uniquely determined. 

ii) Prove that for all k, z = z. (Hint: Theorem 3.12.1.) 


iii) Prove that there the set {z, : k € N} contains exactly n elements. 


3.12.6. “Square” the pentagon drawn below. Namely, estimate the coordinates (real, 
imaginary or length, angle) of various points on the pentagon, square the point, and draw 
its image on a different real plane. You need to plot the image not only of the five vertices, 
but of several representative points from each side to see how the squaring curves the edges. 
(Hint: You may want to use Theorem 3.12.1.) 
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3.13 Topology on the fields of real and complex numbers 


When reading this section, absorb the following main points of topology: open ball, 
open set, limit point, closed set. The main object of this section is to introduce limit points 
of sets so that we can in subsequent chapters talk about limits of functions, sequences, and 
series. 

By a topology on a set we mean that some sets are declared open, subject to the 
conditions that the empty set and the whole set have to be open, that arbitrary unions of 
open sets be open, and that finite intersections of open sets be open. In any topology, the 
complement of an open set is called closed, but a set may be neither open nor closed. A 
topology can be imposed on any set, not just R or C, but we focus on these two cases, and 
in fact we work only with the “standard”, or “Euclidean” topology. 


Definition 3.13.1. Let F be R or C. Let a € F and let r be a positive real number. An 
open ball with center a and radius r is a set of the form 
Bia,r)={xeEF:|x—-al <r}. 
An open set in F is any set that can be written as a union of open balls. 
The following are both B(0,1), but the left one is a ball in R and the right one is a 


ball in C. Note that they are different: by definition the left set is an open subset of R, 
but if you think of it as a subset of C, it is not open (see Exercise 3.13.1). 


Examples 3.13.2. 
(1) B(a,r) is open. 
(2) F = UaerB(a,1) is an open set. 
(3) The empty set is an open set because it is vacuously a union of open sets (see 
page 52). 


(4) For real numbers a < 6, the interval in R of the form (a,b) is an open set in R 
because it is equal to B((a + b)/2, (b—a)/2). The interval (a, oo) is open because 
it equals U%_, B(a +n, 1). 

(5) The set A= {2 € C:1< Rez < 3 and 0.5 < Imz < 2} is open in C. Namely, 
this set is the union Uge4 B(a, min{Rea — 1,3 — Rea, Ima — 0.5,2 — Ima})). 
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(6) The set A = {2 € C: Rex < 1 and Imz < 2} is open in C. Namely, this set is 
the union Uge 4B(a, min{1 — Rea, 2 — Ima}). 


Theorem 3.13.3. B(a,r) has infinitely many points. 


Proof. For each integer n > 2, a+r/n € B(a,r). Since r > 0, these numbers are all 


distinct. Since No is infinite, so is the set of all integers that are at least 2. 


Example 3.13.4. Thus if A is an open subset of F’, then either A is empty or A has 
infinitely many points. In particular, {a} is not open. 


Theorem 3.13.5. Let A be an open set and let a € A. Then there exists r > 0 such that 
Bia,r) CA. 


Proof. Since A is open, it is a union of open balls. Thus a is an element of one such ball 
B(b,s), with B(b,s) C A. 

Since a € B(b,s), we have that ja — b| < s, so that r = 
s — |a—)| is a positive real number. (In the illustration, this is the fo a, 
distance between a and the outside of the circle.) We claim that : eee . 
B(a,r) € B(b,s). To prove this, let z € B(a,r). Then |x —b| = + ? 
|x -a+a—b)| < |x—a|+|a—b)| by the triangle inequality, and since 
|x —a| <r = s—|a—D|, it means that |x—b| < s, so that x € B(b,s). 
This proves the claim, and hence it proves that B(a,r) C A. 


Theorem 3.13.6. (Topology on F’) 
(1) 0 and F are open. 
(2) Arbitrary unions of open sets are open. 
(3) Finite intersections of open sets are open. 


Proof. The empty set can be written as an empty union of open balls, so it is open 
vacuously, and F = Uger B(a,1), so that F' is open. This proves (1). 

Every open set is a union of open balls, and so the union of open sets is a union of 
open balls, hence open. This proves (2). 

Now let Aj,...,A, be open sets. Let a € Ay N---M Ay. By Theorem 3.13.5, for 
each k = 1,...,n, there exists r, > 0 such that B(a,r;,) C Az. Set r = min{ryi,..., rn}. 
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Then B(a,r) C NR_, Bla, rz) C NR_, Ax. Thus for each a € N?_, Ax there exists rz > 0 
such that B(a,ra) C Mp_, Ak. It follows that 


Np=1Ak = Uaenr_, A, B(a, Ta): 


This proves that N_, Ax is a union of open balls, so it is open. 
k=1 


An arbitrary intersection of open sets need not be open. For example, N°@, B(a,1/n) = 
{a} is not open. 


Definition 3.13.7. Let A be an arbitrary subset of F and a € F (not necessarily in A). 
We say that a is a limit point of A if for all real numbers s > 0, B(a, s) contains elements 
of A different from a. 


Examples 3.13.8. 
(1) If A = {a}, then the set of limit points of A is the empty set. 
(2) If A=Q, then the set of limit points of A is R. 
(3) The set of limit points of @ is empty, the set of limit points of R is R, and the set 


of limit points of C is C. 
(4) The set of limit points of B(a,r) equals {x € F': |x —a| <r}. 


Definition 3.13.9. A set A is a closed set if it contains all of its limit points. 
Theorem 3.13.10. A is open if and only if F \ A is closed. 


Proof. Suppose that A is open. Let x € A. By Theorem 3.13.5, there exists r > 0 such 
that B(a,r) C A. Thus B(x,r) 9 (F'\ A) = 0, so that x is not a limit point of F \ A. Thus 
no point of A is a limit point of F \ A, which proves that any limit points of F’ \ A are in 
F\ A. Thus F \ A is closed. 

Now suppose that F' \ A is closed. Let « € A. Since F' \ A contains all of its limit 
points, then x is not a limit point of Ff \ A. Thus by the definition of limit points, there 
exists r > 0 such that B(x,r) (F \ A) is empty. This means that B(x,r) C A. Thus 
A= Ure AaB(x,rx) for appropriate r, > 0, so that A is open. 


The following is now almost immediate from previous results: 
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Theorem 3.13.11. (Topology on F’) 
(1) 0, F are closed sets. 
(2) Arbitrary intersections of closed sets are closed. 


(3) Finite unions of closed sets are closed. 


Proof. Exercise 2.1.8 proves that the union of the complements of two sets equals the 
complement of the intersection and that the it was proved that the intersection of the 
complements of two sets equals the complement of the union, and an equally easily proved 
mathematical truth is the following generalization to possibly many more sets: 
F\(J4An=(\(F\ 4x), F\() Ae = U(F \ An). 
kel kel kel kel 


With this, (2) and (3) follow from the last two theorems, and the proof of (1) is trivial. 


Both @ and F are open and closed, and these turn out to be the only sets that are 
both open and closed (see Exercise 3.13.2). Some sets are neither open nor closed (see 
Exercise 3.13.1). 


Remark 3.13.12. (This remark puts Theorems 3.13.6 and 3.13.11 in the more general 
context and is not needed in the first course in analysis.) Any set F (not necessarily a 
field) is a topological space if there exists a collection J of subsets of F’ such that the 
following properties are satisfied: 

(1) 0,F € 7, 

(2) Arbitrary unions of elements in J are in J. 

(3) Finite intersections of elements in J are in J. 
Elements of J are called open. Subsets of F' that are complements of open sets are called 
closed. The proof of Theorem 3.13.11 for closed sets in this topological space are proved 


in the same way. 


Exercises for Section 3.13 


3.13.1. Let A be the open ball in R of radius 1 and centered at 0. Since R is a subset of 
C, then A is also a subset of C. Prove that A is neither a closed nor an open subset of C. 


3.13.2. Let F be either R or C, and let A be a subset of F' that is both closed and open. 
and let A be a closed and open subset of F’. Prove that A=@ or A= F. 


3.13.3. Prove that Q is neither an open nor a closed subset of R or C. 
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3.13.4. Sketch the following subsets of C. Determine their sets of limit points, and whether 
the sets are open, closed, or neither: {2 € C: Ima =0 and 0 < Reg < 1}, {wm € C: Imax = 
Oand0 < Rea < 1}, {a € C: -2 < Ima < 2and0 < Reg < 1}, {fn €C:2<Imze< 
—2 and 0 < Reg < 1}, {1/n:neEN*}. 


3.13.5. For each of the following intervals as subsets of IR, determine the set of limit points, 


and whether the set is open, closed, or neither in R: 
(0, 1), [0, 1], [3, 5), [8, co). 


3.13.6. Let a be a limit point of a set A. Suppose that a set B contains A. Prove that a 
is a limit point of B. 


3.13.7. Give examples of sets A C B C C and a € C such that a is a limit point of B but 
not of A. 


3.13.8. Let A be a subset of C all of whose elements are real numbers. Prove that every 
limit point of A is a real number. 


3.14 The Heine-Borel theorem 


Closed and bounded sets in C and R have many excellent properties — we will for 
example see in Section 5.3 that when a good (say continuous) real-valued function has a 
closed and bounded domain, then that function achieves a maximum and minimum value, 
et cetera. The concept of uniform continuity (introduced in Section 5.5) needs the fairly 
technical Heine-Borel theorems proved in this section. 


Construction 3.14.1. (Halving closed and bounded subsets of R and quartering closed 
and bounded subsets of C) Let A be a bounded subset of of R or of C, and let P be 
a property that applies to some subsets of A. Boundedness of A guarantees that A fits 


inside a closed bounded rectangle Ro of the form (ao, bo) X (co, do) in C, with co = 0 = do 
if A is a subset of R. The rectangle can be halved lengthwise and crosswise to get four 
equal closed subrectangles. In the next iteration we pick, if possible, one of these four 
closed quarter subrectangles such that its intersection with A has property P. We call this 
subrectangle R,. If A is a subset of R, then the length of R, is half the length of Ro, and 
otherwise the area of R; is a quarter of the area of Ro. In general, once we have R,,, we 
similarly pick a subrectangle R,41 such that R,4,9A has property P and such that the 
sides of Ry+1 are half the lengths of the sides in R,. Write R, = [an, bn] x [en, dn] for some 
real numbers a, < b, and cy, < dy. By construction, for all n, b, — an = (bo — ag) /2”, and 


Go Sa) Sag <°°+S an <-°- Sdn) < bg < D1 < Do. 
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This means that {a1,d@2,a3,...} is a non-empty subset of R that is bounded above, so 


that by the Least upper bound theorem (Theorem 3.8.5), a = sup{ai, d2,a3,...} is a real 
number. Similarly, b = inf{b1, bz, b3,...} is areal number. Since a < 6, b2, b3,..., it follows 
that a < b. Suppose that a < b. Then by Exercise 3.8.3, there exists a positive integer N 
such that 1/2" < (b—a)/(b9 — ao). But an <a <b < by, so that 0<b-—a<by—ay = 
(by — ao) /2% <b—a, which contradicts trichotomy. Thus a = 8, i.e., we just proved that 


sup{a@1, a2, a3,...} = inf {by, ba, b3,...}. 


Similarly, 


c = sup{c1, C2, ¢3,...} = inf{dj, do, d3,...}. 


This means that the intersection of all the R,, equals the set {a+ ci}, consisting of exactly 
one complex number. By the shrinking property of the subrectangles, for every 6 > 0 there 
exists a positive integer N such that RyN AC B(a+t ci,d). 

In particular, “quartering” of the closed and bounded region (interval) A = 
[ao, 69] C +R, means halving the rectangle (interval), and the intersection of all the chosen 
closed half-rectangles is a set with exactly one element. That element is in A, so a real 


number. 


Theorem 3.14.2. (The Heine-Borel theorem (in R, C)) Let A be a closed and 
bounded subset of R or C. For each c € A let 6, be a positive number. Then there exists 
a finite subset S of A such that A C Uces B(c, 5c). 


Proof. We declare that a subset B of A satisfies (property) P if there exists a finite subset 
S of B such that B C Uces B(c, 6). We want to prove that A has P. 

Suppose for contradiction that A does not have P. Since A is closed and bounded, it 
fits inside a closed rectangle Ry. With Construction 3.14.1, we construct iteratively nested 
subrectangles Rg D Ry D Rog D---. The quarter subrectangles are chosen so that each 


Ry, OA does not have P. This is true ifn = 0 by assumption. Suppose that R, has been 
chosen so that R,,A does not have P. If the intersection with A of each of the four quarter 
subrectangles of R,, has P, i.e., if each of the four subrectangles (as in Construction 3.14.1) 
intersected with A is contained in the union of finitely many balls B(c,6.), then R,M A is 
covered by finitely many such balls as well, which contradicts the assumption on R,,. Thus 
it is possible to choose R,p+1 so that Ryz+19A does not have P. By construction, N°, Rn 
contains exactly one point. Let that point be x. Since each R, A has infinitely many 
points, x is a limit point of A, and since A is closed, necessarily x € A. 

By the shrinking sizes of the R,, there exists a positive integer N such that R, C 
B(az, 6). But then R, A has P, which contradicts the construction. Thus A has P. 
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Remark 3.14.3. Let F be R or C and let A be a closed and bounded subset of F’. Let T 
be a collection of open subsets of F’ such that A C UyerU. This set containment is usually 
referred to as T being an open cover of A. By the definition of set containment, for each 
c € A there exists U. € T such that c € U,. Since U, is open, by Theorem 3.13.5, there 
exists 6. > 0 such that B(c,d.) GC U-. The Heine-Borel theorem Theorem 3.14.2 asserts 
that there exists a finite subset S of A such that A C UcesB(c, 6.)U-U-. In other words, the 
Heine-Borel theorem asserts that every open cover of a closed and bounded subset 
of C has a finite subcover. 


Remark 3.14.4. In the more general context of topological spaces as in Remark 3.13.12, 
there need not be a notion of bounded sets. A set for which every open cover has a finite 
subcover is called compact. So Theorem 3.14.2 proves that every closed and bounded 
subset of C is compact. 


Theorem 3.14.5. Let A be a closed and bounded subset of R or C, and for each a € A 
let 6g be a positive number. Then there exist a finite subset S of A and a positive real 
number 6 such that A C UcesB(c,6-) and such that for all x € A there exists c € S' such 
that B(x,6) C Bic, 6). 


Proof. By Theorem 3.14.2, there exists a finite subset S of A such that A C Uces B(c, 6-/2). 
Let 6 = 5 min{d. :c € S}. Since S is a finite set, 6 is a positive real number. 

Let x € A. By the choice of S there exists c € S such that x € Bi(c,6,/2). Let 
y € B(a, 6). Then 


ly—cl=|ly-—a+a—cl <ly—a)+la—cl<64+6./2 < b, 


so that y € B(c,4-). It follows that B(x,5d) C Bic, 6¢). 


Exercises for Section 3.14 


3.14.1. Let A= {a €R:2 <2 < 3}. Prove that A C Use, B(a,1/a). Does there exist a 
finite set S of A such that A C UgesB(a,1/a). If yes, find it, if no, explain why not. 
3.14.2. Let A= {1/n:n€N*} CR. Prove that A C UgeaB(a,a/3). Does there exist a 
finite set S of A such that A C UgesB(a,a/3). Repeat with A = N*. 

3.14.3. Let A be a closed and bounded subset of R or C. Let J be a set and for each 
k € I let Uy, be an open subset of F’. Suppose that A C Uner U;,. Prove that there exists a 
finite subset K of J such that A C Une x Ux. (Hint: Prove that for each a € A there exists 
da > 0 such that B(a,d,) is in some U;. Apply Theorem 3.14.2.) 


Chapter 4: Limits of functions 


Limits are a foundation of analysis. They are an important mathematical concept 
and they lend themselves nicely to practicing proofs. 

Section 4.1 contains the formal definition together with informal intuitive pictures 
and explicit examples of limit proofs. Many of the examples are worked out in the longer 
this-is-how-we-think version as well as in the shorter this-is-how-we-write version. The for- 
mal definition is referred to as the “epsilon-delta” definition for obvious reasons. Section 4.2 
is a lesson in logical negation on what it means for a number to not be a limit. Section 4.3 
looks at the epsilon-delta definition of limits more finely: the order and importance of 
the quantifiers matter, and small modifications can change the meaning significantly. The 
lesson of the section is that it is important to remember any statement precisely. The 
epsilon-delta proofs tend to be time-consuming, so Section 4.4 proves alternative theorems 
that shortcut that work for many nice functions. But for many limits an epsilon-delta ar- 
gument is the only possible proof, so it is important to master the method. In the first five 
sections all is happening in subsets of C, Section 4.5 transitions to limits for real-valued 
functions being plus or minus infinity, and Section 4.6 for functions whose domain is a 
subset of R allows finite or infinite limits to be taken at +too. 


4.1 Limit of a function 


All calculus classes teach about limits, but the domains there are typically intervals 


in R. Here we learn a more general definition for (more interesting) domains in C. 


Definition 4.1.1. Let A be a subset of C and let f : A— C bea function. Suppose that 
a complex number a is a limit point of A (see Definition 3.13.7). The limit of f(x) as x 
approaches a is the complex number L if for every real number ¢ > 0 there exists a 
real number 6 > 0 such that for all x € A, if 0 < |x —al| < 6, then |f(x) — L| <.e. 

When this is the case, we write lim f(x) = L. Alternatively, in order to not make 


lines crowded with subscripts, we write lim, 44 f(x) = L. 


It is important to note that we are not asking for f(a). For one thing, a may or may 
not be in the domain of f, we only know that a is a limit point of the domain of f. We 
are asking for the behavior of the function f at points near a. 
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We can give a simple geometric picture of this in case the domain and codomain 


are subsets of R (refer to Remark 3.10.3 for why we cannot draw functions when domains 


and codomains are subsets of C). Below are three graphs of real-valued functions defined 
on a subset of R and with a being a limit point of the domain. In each, on the graph of 
y = f(x) we cover the vertical line x = a, and with that information, we conclude that 
lings asf (an) She 


The function f from the first graph above might be any of the following: 


Intuitively we are hoping that f(x) for x near a can predict a trend for the value of 
f as we get arbitrarily close to a. For example, we may not be able to bring x to 0 Kelvin, 
but if we can take measurements f(x) for x getting colder and colder, perhaps we can 
predict what may happen at 0 Kelvin. But how believable is our prediction? Perhaps for 
our theory to be satisfactory, we need to run experiments at temperatures x that give us 
f(x) within « = 10 of the predicted value. Or when instruments get better, perhaps € gets 
smaller, say one thousandth. Or a new material is discovered which allows even smaller 
e«. But no matter what € is determined ahead of time, for the prediction to be believable, 
we need to determine a fixed range of x, within a 6 of a but not equal to a, for which the 
f-values are within the given € of the prediction. 


A graphical way of representing the epsilon-delta definition of limits for real-valued 
functions with domains in R is as follows: For every positive € there exists a positive 6 such 
that for all 2 in the domain with 0 < |x —a| <6 (the x £ a in the vertical gray band), the 
value of f(x) is within € of L (in the horizontal grey band): 


162 Chapter 4: Limits of functions 


L+e 
L pee ee, eee 
L-e 


a—-d@a+6 


If € gets smaller, 6 has to get smaller too; but we may keep the old 6 for larger e. 

While these pictures can help our intuition, they do not constitute a proof: the 
definition is an algebraic formulation, and as such it requires algebraic proofs. In the rest 
of the section we examine many examples algebraically, with the goal of mastering the 
epsilon-delta proofs. But epsilon-delta proofs are time-consuming, so in the future we will 
want to replace them with some shortcuts. We will have to prove that those shortcuts 
are logically correct, and the proofs will require mastering abstract epsilon-delta proofs. 
Naturally, before we can master abstract epsilon-delta proofs, we need to be comfortable 
with epsilon-delta proofs on concrete examples. In short, in order to be able to avoid 
epsilon-delta proofs, we have to master them. (Ha!) 


Example 4.1.2. lim (4% — 5) = 7. 


L383 


Proof. The function that takes x to 4% — 5 is a polynomial function, so it is defined for 
all complex numbers. Thus the domain of the function is C and 3 is a limit point of the 
domain. Let « > 0. [WE ARE PROVING THAT FOR ALL REAL NUMBERS € > 0 SOMETHING- 
OR-OTHER HOLDS. RECALL THAT ALL PROOFS OF THIS FORM START WITH “LET € BE 
AN ARBITRARY POSITIVE REAL NUMBER,” OR ABBREVIATED AS WE DID. NOW WE HAVE 
TO PROVE THAT THE SOMETHING-OR-OTHER HOLDS. BUT THIS SOMETHING-OR-OTHER 
CLAIMS THAT THERE EXISTS A REAL NUMBER 0 > 0 WITH A CERTAIN PROPERTY. THUS 
WE HAVE TO CONSTRUCT SUCH A 6. IN THIS FIRST EXAMPLE I SIMPLY PRESENT A 0 
THAT WORKS, BUT IN SUBSEQUENT EXAMPLES I SHOW HOW TO FIND A WORKING 0. IN 
GENERAL, THE PROOF SHOULD CONTAIN A SPECIFICATION OF 6 AND A DEMONSTRATION 
THAT IT WORKS, BUT HOW ONE FINDS THAT 6 IS IN GENERAL LEFT TO SCRATCH WORK 
OR TO INSPIRATION. | Set 6 = €/4. [NEVER MIND HOW THIS MAGIC €/4 APPEARS HERE; 
WAIT UNTIL THE NEXT EXAMPLE WHERE I PRESENT A PROCESS OF FINDING 6.] Then 
6 is a positive real number. [SURE! — NOW WE HAVE TO PROVE THAT FOR ALL 2, IF 
0 < |a — 3] < 6, THEN |f(x) — L| < «. THE PROOF OF “FOR ALL & ...” STARTS WITH:] 
Let x be an arbitrary complex number. |FOR THIS 2 WE NOW HAVE TO PROVE THAT IF 
0 < |x — 3] < 6, THEN |f(x) — L| < «. THE PROOF OF “IF P THEN Q” STARTS WITH 
“ASSUME P.”] Assume that 0 < |x — 3| < 6. [NOW WE HAVE TO PROVE Q, I.E., WE 
HAVE TO PROVE THAT |f(a) — L| = |(4z — 5) —7| < €. WE DO NOT SIMPLY WRITE 
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“(4a — 5) —7| < €&” BECAUSE WE DO NOT KNOW THAT YET. WE WRITE THE LEFT SIDE 
OF THIS INEQUALITY, AND MANIPULATE IT — ALGEBRAICALLY, OFTEN WITH TRIANGLE 
INEQUALITIES AND SEVERAL STEPS, UNTIL WE GET <e. ] Then 
|(4z — 5) — 7| = |4a — 12| 
= Alr —3| 
< 46 


= 45 [WASN’T €/4 A CLEVER CHOICE OF 07] 


= €. 


The commentary in the proof above is describing the thought process behind the 

proof but need not and should not be written out in homework solutions. Below is a 
homework-style solution: 
Proof of Example 4.1.2 without the commentary: The function that takes x to 4x” — 5 is 
a polynomial function, so it is defined for all complex numbers. Thus the domain of the 
function is C and 3 is a limit point of the domain. Let « > 0. Set 6 = €/4. Then d isa 
positive real number. Let x be an arbitrary complex number. Assume that 0 < |x—3] < 6. 
Then 


|(4a —5) — 7| = |4a — 12| 
= Alr -3| 
< 46 
€ 
42 
4 


= €. 


In the previous example 6 appeared magically as €/4, and it happened to be a positive 
number depending on € that made the limit proof work. Other numbers would have worked 
as well, such as €/5, or €/(10000+ €), and so on. For good style choose simple formulations 
over complicated ones. If some positive number works as 6, so does any smaller positive 
number, so there is no smallest possible 6 and there are many different correct choices of 6. 
It is not necessary to find the largest possible 6 for a given ¢€, and it is not even necessary 
to show how you derived or chose your 6. However, it is necessary to show that your pick 
of 6 does satisfy the rest of the defining property of limit. 

Think through the comments in the previous paragraph on the example lim,_,4 5 = 5: 
any positive number works for 0. 

In further examples we show all the necessary work for proofs of limits including how 
to determine the 6. It is standard to use the proof-writing trick of partial filling-in: when 
by the definition of limit it is time in the proof to declare what 6 is, we typically write “Set 
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= ”. and in subsequent reasoning we fill in the underlined blank 


with whatever restrictions seem necessary. Study the proofs below for how this is done, 
but let me just say that each restriction puts an upper bound on 6, say bound 6 above 
by 1 and by €?/4, and this is accomplished by filling in “Set 6 = min{1, «2/4 ae 
with room for possible further restrictions. (In Example 4.1.2 we only had one restriction, 
namely 6 < €/4, so that 6 = min{e/4} = €/4.) 

I find that the following two goals make these proofs more concrete and doable: 


e Goal #1: Write | f(x) — L| as less than or equal to (something) times |x — al, 
If f is a rational function, then |x — a| better be a factor of | f(a) — L|! 

e Goal #2: Find a positive constant upper bound B on the (something) found in 
Goal #1. Then you make sure that 6 < €/B. 
You may want to keep these guidelines in mind as you read the proofs below. 


Example 4.1.3. lim, (400° — 5x +2) = 28. 
L—-— 


Proof. The function that takes x to 4x? — 52 + 2 is a polynomial function and it is 
defined for all complex numbers. Thus —2 is a limit point of the domain. Let € > 0. 
Set 6 = [5 TO BE DETERMINED STILL; THE FINAL WRITE-UP WILL 
HAVE THIS FILLED IN, BUT WE DO NOT YET KNOW 0.]. Then 6 is a positive real number. 
(HOPING, ANYWAy.] Let « be any complex number. Suppose that 0 < |a + 2| < 6. Then 


|(4a? — 5a +2) — 28| = |4n? — 5a — 26| 
[WANT GOAL #1: «-a=2%-+2 BETTER BE A FACTOR.| 
= |(a + 2)(4x — 13)| 
= |4xr — 13|- |a + 2| [GOAL #1 ACCOMPLISHED] 
[GOAL #2: WE WANT TO BOUND ABOVE THE COEFFICIENT |4” — 13| OF |a — al BY A 


CONSTANT. BUT OBVIOUSLY |4x — 13] IS NOT BOUNDED ABOVE BY A CONSTANT FOR 
ALL 2. SO WE NEED TO MAKE A RESTRICTION ON 6: LET’S MAKE SURE THAT 6 IS AT 
MOST 1 (OR 2, OR 15, IT DOES NOT MATTER WHAT POSITIVE NUMBER YOU PICK IN THIS 
EXAMPLE). SO ON THE 6 LINE WRITE: 6 = min{1, }. THIS GUARANTEES THAT 
6 WILL BE 1 OR SMALLER, DEPENDING ON WHAT GETS FILLED IN AFTER THE COMMA.] 

= |4(a + 2 — 2) — 13] - |a + 2| (adding a clever zero) 

= |4(a + 2) — 21|- |x +2| 

< Cc +2)|+ 21) -|x+2| 

(by the triangle inequality |a + b| < |a| + |)|) 
< (4-14 21)- |x + 2| (since |x + 2| < 6 < 1) 
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= 25-|x+2| [GOAL #2 ACCOMPLISHED. | 
[Now GO BACK TO SPECIFYING 6 AT THE BEGINNING OF THE PROOF BY FILLING IN 


WITH: 6 = min{1,¢/B}. THIS MEANS THAT 6 IS THE SMALLER OF 1 AND €/B, AND IN 
PARTICULAR 6 <1 AND 6 < ¢/B.| 

< 25-06 

< 25 - €/25 


= €. 


The final version of the proof of lim, (4° — 5a + 2) = 28 then looks like this: 
£>— 


The function that takes x to 4x? — 5a + 2 is a polynomial function and it is defined 
for all complex numbers. Thus —2 is a limit point of the domain. Let € > 0. Set 6 = 
min{1,¢/25}. Then 6 is a positive real number. Let x be any complex number. Suppose 
that 0 < |x +2| < 6. Then 

|(4a? — 5a +2) — 28] = |4a? — 5a'= 26] 
= |(a + 2)(4a — 13)| 
= |4x — 13] - |x + 2| 
= |4(@ + 2 — 2) — 13] - |w + 2] (adding a clever zero) 
= |4(a + 2) — 21|- |x +2| 
< (|4(a + 2)| + 21) - |x + 2| 
(by the triangle inequality |a + b| < |a| + |)]) 
< (4-1+421)-|xz+ 2| (since |x + 2] < 6 < 1) 
= 25-|x+2| 
<25%0 
< 25+ €/25 


= €. 


The next example has the same type of discovery work with fewer comments. 
Example 4.1.4. lim (4/2*) =, 
Lt 


Proof. The function that takes x to 4/x? is defined for all non-zero complex numbers, so 
—1 is a limit point of the domain. Let «€ > 0. Set 6 = . Then 6 is a positive 
real number. Let « be any complex number that satisfies 0 < |x +1| <6. Then 


4 — 4x? 
ED 


ene | 
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(1—2)(1+72) 


=4 


1-2 


=4 Jl +2] 


2 
GOAL #1 ACCOMPLISHED: SOMETHING TIMES |z — al.] 
[NOW WE WANT THE SOMETHING 4 (43£) TO BE AT MOST SOME CONSTANT. CERTAINLY 


2 


IF WE ALLOW x TO GET CLOSE TO 0, THEN (1 —2x)/x? IS VERY LARGE, SO IN ORDER TO 

FIND AN UPPER BOUND, WE NEED TO MAKE SURE THAT x STAYS AWAY FROM 0. SINCE & 

IS WITHIN 0 OF —1, IN ORDER TO AVOID 0 WE NEED TO MAKE SURE THAT 0 IS STRICTLY 

SMALLER THAN 1. FOR EXAMPLE, MAKE SURE THAT 6 < 0.4. THUS, ON THE 0 LINE 

WRITE: 6 = min{0.4, al 

2—(a#+4+1) 
|x|? 


2 1 
case + a2| (by the triangle inequality) 


=| fin+al (by rewriting 1-2 =2-(#£+1)) 


<4 
= re 


2.4 

|x|? 

= sel + 2| (by rewriting x =x+1-1) 
jz +1-—1/? 


9.6 
< oer! + 2| (by the reverse triangle inequality because 


<4 


|1 + x| (since 6 < 0.4) 


ee ot Sle Sa eS tO, 
so that 1/|x +1-—1|? < 1/0.67) 


[ON THE 5-LINE NOW WRITE: 6 = min{0.4, 0.67¢/9.6}.] 
9.6 

—6 

0.62 


965 


< 


= €. 


And here is another example with 6 already filled in: 


4a? te = = 
8xr-1 2. 


Example 4.1.5. lim 


L221 


Proof. The domain of 40 te consists of all complex numbers different from 7/8, so 27 is a 
limit point of the domain. Let « > 0. Set 6 = min{1,¢/9}. [IT IS SO OBVIOUS THAT THIS 
MINIMUM OF TWO POSITIVE NUMBERS IS POSITIVE THAT WE SKIP THE ASSERTION “THUS 
6 IS A POSITIVE REAL NUMBER.” DO NOT OMIT THE ASSERTION OR THE CHECKING OF 


ITS VERACITY FOR MORE COMPLICATED SPECIFICATIONS OF 6.] Let x be any complex 
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number different from 7/8 such that 0 < |#—27| < 6. [HERE WE MERGED: “LET 2 BE ANY 
COMPLEX NUMBER DIFFERENT FROM i/8. LET x SATISFY 0 < |x — 2%| < 6.” INTO ONE 
SHORTER AND LOGICALLY EQUIVALENT STATEMENT “LET 2 BE ANY COMPLEX NUMBER 
DIFFERENT FROM i/8 SUCH THAT 0 < |x — 2i| < 6.”] Then 


7 -(-2) = 3 +] 
Ax? + x + 162 — 2i 
8x —1 

Ag? 179-21 
8x —1 

(4x? + 8ix + 1)(x — 21) 

8x — 1 

(4x? + 8ix + 1) 
8x —1 


|a — 23| 


(Goal #1 is accomplished: x — a is a factor.) 


Ax? 82 1 
Fa |4a~| + | ae + |x — 2i| (by the triangle inequality) 
|8a — 4| 
< A@ 2+ 2) + Sle = 2+ i+ 1), aii 
|8(a — 2%) + 153] 
— ) 2 = } 
¥ A(|a% — 2%] + 2)° + 8(|x — 2¢| + 2) sa 2i| 


= —8|x — 21] + 15 
(by the triangle and reverse triangle inequalities) 
Bs 4(1+2)?+8(1+2)+1 


= —8 +15 pe 
(since |x — 27] < 6 < 1) 
61 
= 7 le = 24] 
< 96 
< 9/9 


The next example is of a limit is at a non-specific a. 


Example 4.1.6. lim (2? — 2x) = a? — 2a. 


wa 
Proof. Any a is a limit point of the domain of the given polynomial function. Let € > 0. 
Set 6 = min{1,¢/(1+ |2a — 2|)}. Let x satisfy 0 < ja —a| < 6. Then 
|(@? — 22) = (a? = 2a)| = |(2? — a?) — (2x = 2a)| (by algebra) 
= |(x +. a)(x — a) — 2(a — a)| (by algebra) 
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= |(x + a — 2)(a — a)| (by algebra) 

=|£&+a— 2] |x —al 

= |(a — a) + 2a — 2| |x — a| (by adding a clever 0) 

< (|e — a] + [2a — 2|) |x — al 
(by the triangle inequality) 

< (1+ |2a — 2|) |” — a| (since |x — al < 6 <1) 
(1+ [2a =2/)6 
( 


< 
< (14+ |2a — 2))e/(1 + |2a — 2]) 


= €. 


Remark 4.1.7. Note that 6 depends on € and a, which are constants in the problem; 6 is 
not allowed to depend on 2, as the definition goes: 

“for all € > 0 there exists 6 > 0 such that for all x, etc” 
so that x depends on 6, but 6 does not depend on «x. 
By the definition of limits, 6 is supposed to be a positive real number, not a function of x. 
(See also Exercise 4.1.1.) 


Remark 4.1.8. In all cases of rational functions, such as in examples above, Goal #1 is 
to factor x — a from f(x) — L: if x —a is not a factor, check your limit or algebra for any 
mistakes. In the next example, x — a is not a factor, but x — a is. 


Example 4.1.9. lim, V2x —-6=0. 

Poe 
Proof. The domain here is all x > 3. So 3 is a limit point of the domain. Let « > 0. Set 
6 =? /2. Let x > 3 satisfy 0 < |z — 3| < 6. Then 


V2e —6 = V2-Vz—3 
< V2-V6 (because /_ is an increasing function) 


= v2. Yep 


= €. 


Often books consider the last example as a case of a one-sided limit (see definition 
below) since we can only take the x from one side of 3. Our definition handles both-sided 
and one-sided and all sorts of other limits with one simple notation, but we do have a use 
for one-sided limits as well, so we define them next. 
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Definition 4.1.10. Let AC R,aé€R, L € C, and f : A> C a function. Suppose that 
a is a limit point of {x € A: x > a} (resp. of {c € A: x < a}). We say that the right- 
sided (resp. left-sided) limit of f(x) as x approaches a is L if for every real number 
€ > 0 there exists a real number 6 > 0 such that for all x € A, if0 <x—a < 6 (resp. if 
0<a-—«z <6) then |f(x) — L| < «. When this is the case, we write lim,_,,+ f(x) = L 
(resp. limtj.<.= f(a) =) 


With this, Example 4.1.9 can be phrased as lim,_,3+ 2x — 6 = 0, and the proof goes 
as follows: The domain A consists of all x > 3, and 3 is a limit point of AN{x@: x > 3}=A. 
Let « > 0. Set 6 = €?/2. Let x satisfy 0 <x —3< 6. Then 

V22 —-6=V2-Vr—3 
< V2-Vv6 
- vi. JER 


= €. 


Thus, the two proofs are almost identical. Note that lim,_,3;- 2x —6 does not exist 
because 3 is not the limit point of AN {xz € R: a < 3}=90. 


One-sided limits can also be used in contexts where lim,-,, f(a) does not exist. 
Below is one example. 


ga. a Sas 
GoD oes. 


Then 


Example 4.1.11. Let f : R — R be given by f(x) = { 
lites bf (7): = Se Hints fe Se. 


Proof. Let € > 0. Set 6 = min{1,¢/3}. Let x satisfy 0 << «—1< 06. Then 
|f(z) — 5) = |x? +4 — 5] (since x > 1) 
= |e? = 1 
= |x +1)(e-1) 
< |x +1]6 (since x > 1, so x + 1 is positive) 
= |x — 1+ 2|d (by adding a clever 0) 
< (|2 — 1| + 2)6 (by the triangle inequality) 
< (1+2)6 (since O0< e-—1<6<1) 
< 3/3 


= €. 


This proves that lim,_,,+ f(a) = 5. 


170 Chapter 4: Limits of functions 


Set 6 =e. Let x satisfy 0<1—2a2 <6. Then 
| f(z) — (—1)| = |e —2+1| (since x < 1) 
=|x-1| 


a) 


This proves that lim,_,,- f(z) = —3. 


Exercises for Section 4.1 
4.1.1. Below is an attempt at a “proof” that lim,_,3 7? = 9. Explain how the two starred 
steps contribute to at least three mistakes total. 
Let € > 0. Set 6 =* €/|a +3]. Then 
|x? — 9| = |2 — 3|-|a2 +3] 
<* 6x + 3] 


= €. 


4.1.2. Fill in the blanks of the following proof that lim, ,2(? — 3x) = —2. Explain why 
none of the inequalities can be changed into equalities. 


so that 2 is a . Let e > 0. Set 6 = . Let x satisfy 
0 < |x —2| <6. Then 


l(a? = 32) =(—2)|/= la? — 3x + 2| 


= |x—1||xz—2| (because ) 
= |r—2+1| |x—2| (by ) 
< (|e -2)+1) |e-2) (by ) 
<(3+1) |a—2| (because ) 
<4d (because ) 

< 45 (because ) 


= €. 


i) If the domain of this function is R as opposed to C, then |x — 2] < 4 can be shown 
also with the following proof: Since |x — 2| < 6 < 3, then —3 < x — 2 < 3, so that 
—2 <x«-—1 <4, which means that |x — 1] < 4. Say why this argument does not 


work if the domain is C. 
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4.1.3. Determine the following limits and prove them with the epsilon-delta proofs. 


) 1 
ii) lim +. 
zo2 * 
iii) lim, oy 
iv) lim, Vxr+0. 
x 
: x9 
oe 


4.1.4. Rework Example 4.1.3 with choosing 6 to be at most 2 rather than at most 1. 
4.1.5. Let b€ Cand f,g: CC with 

Heat ets M1, regs 
Prove that the limit of f(a) as x approaches 5 is independent of b, but that the limit of 
g(x) as x approaches 5 depends on b. 
4.1.6. Suppose that a is a limit point of {x € A: x > a} and of {x € A: x <a}. Prove 
that lim, _., f(x) = L if and only if lim,_,,+ f(z) = E and lim,_,,- f(x) = L. 
4.1.7. Suppose that a is a limit point of {c € A: x > a} but not of {1 € A: a < a}. 
Prove that lim,_,4 f(x) = L if and only if lim, _,,+ f(x) = L. 
4.1.8. Prove that lim, .,(mz +1) = ma-+l, where m and / are constants. 
4.1.9. Let f : R > R be given by f(x) = te]; Prove that lim,_,o+ f(x) = 1 and that 
lim, ,9- f(z) =—1. 
4.1.10. Find a function f : R > R such that lim,_,9- f(x) = 2 and lim,_,9+ f(x) = —5. 


4.1.11. Find a function f : R > R such that lim,_,9- f(x) = 2, lim,_,9+ f(x) = —5, 
lim, .1- f(z) = 3, and lim,_,;+ f(z) = 0. (Try to define such a function with fewest 


possible words or symbols, but do use full grammatical sentences.) 


4.2 When a number is not a limit 


Recall that lim,-., f(z) = DL means that a is a limit point of the domain of f, and 
that for all real numbers € > 0 there exists a real number 6 > 0 such that for all x in the 
domain of f, if 0 < |x -—a| < 6 then |f(z) — L| < «. [THINK oF lim,_,, f(z) = D as 
STATEMENT P. a BEING A LIMIT POINT OF THE DOMAIN AS STATEMENT Q, AND THE 
EPSILON-DELTA PART AS STATEMENT R. By DEFINITION, P 1S LOGICALLY THE SAME AS 
THE STATEMENT Q and R.| 

Thus if lim,.. f(a) 4 L, then either a is not a limit point of the domain of f or else 
it is not true that for all real numbers € > 0 there exists a real number 6 > 0 such that for 
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all x in the domain of f, if 0 < |x —a| < 6 then | f(x) — L| < «. [THIS SIMPLY SAYS THAT 
not P IS THE SAME AS (not Q) or (not R).| 


In particular, 
lim f(z) 4 L and a is a limit point of the domain of f 
ra 


means that it is not true that for all real numbers ¢€ > 0 there exists a real number 6 > 0 
such that for all 2 in the domain of f, if 0 < |x —a| <6 then |f(x) — L| < e. |THIS Says 
THAT (not P)andQ IS THE SAME AS not R. YOU MAY WANT TO WRITE TRUTH TABLES 
FOR YOURSELF. | 

Negations of compound sentences, such as in the previous paragraph, are typically 
hard to process and to work with in proofs. But by the usual negation rules of compound 
statements (see chart on page 32), we successively rewrite this last negation into a form 
that is easier to handle: 


not (For all real numbers ¢€ > 0 there exists a real number 6 > 0 
such that for all x in the domain of f, if 0 < jr—a| <6 
then | f(x) — L| < «.) 
[NEGATION OF “FOR ALL z OF SOME KIND, PROPERTY P HOLDS” IS “THERE IS SOME z 
OF THAT KIND FOR WHICH P IS FALSE.” HENCE THE FOLLOWING REPHRASING:] 


= There exists a real number € > 0 such that not (there exists a 
real number 6 > 0 such that for all x in the domain of f, if 
0 < |x —a| <6 then |f(x) —L| <«.) 
[NEGATION OF “THERE EXISTS z OF SOME KIND SUCH THAT PROPERTY P HOLDS” IS 
“FOR ALL z OF THAT KIND, P IS FALSE.” HENCE THE FOLLOWING REPHRASING:| 


= There exists a real number ¢€ > 0 such that for all real numbers 
6 >0, not (for all x in the domain of f, if 0 < |x—a| <6 
then | f(x) — L| < «.) 
[NEGATION OF “FOR ALL Zz OF SOME KIND, PROPERTY P HOLDS” IS “THERE IS SOME z 
OF THAT KIND FOR WHICH P IS FALSE.” HENCE THE FOLLOWING REPHRASING: | 
= There exists a real number ¢€ > 0 such that for all real numbers 
6 > 0, there exists x in the domain of f such that not (if 
0 < |x —a| <6 then |f(z) —L| <«.) 
[NEGATION OF “IF P THEN Q” IS “P AND NOT Q.” HENCE THE FOLLOWING REPHRAS- 
ING: | 


= There exists a real number ¢« > 0 such that for all real numbers 
6 > 0, there exists x in the domain of f such that 0 < 
|z — a| <6 and not (|f(x) — L| < «.) 
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= There exists a real number ¢« > 0 such that for all real numbers 
6 > 0, there exists x in the domain of f such that 0 < 
| —a| <6 and |f(x) — L| >. 


In summary, we just proved that: 
Theorem 4.2.1. If a is a limit point of the domain of f, then lim,-,, f(x) # L means 


that there exists a real number € > 0 such that for all real numbers 6 > 0, there exists x 
in the domain of f such that 0 < |x —a| < 6 and |f(x) — L| >. 


Example 4.2.2. The limit of iat as x approaches 0 does not exist. In other words, for all 


complex numbers L, limz_+o9 il ae oe 


The domain of the function that takes x to ial 


7] is the set of all non-zero complex 


x 
> [a] 
angle as x. Thus the image of this function is the unit circle in C. Note that it is possible 


numbers. For each non-zero x is a complex number of length 1 and with the same 
to take two non-zero x very close to 0 but at different angles so that their images on the 
unit circle are far apart. This is a geometric reasoning why the limit cannot exist. Next 
we give an epsilon-delta proof. 


Proof. The domain of the function that takes x to al is the set of all non-zero complex 


numbers, so that 0 is a limit point of the domain. [THUS IF THE LIMIT IS NOT L, THEN 
IT MUST BE THE EPSILON-DELTA CONDITION THAT FAILS.] Set ¢ = 1. Let 6 > 0 be an 
arbitrary positive number. Let « = —d/2 if Re(L) > 0, and let x = 6/2 otherwise. Then 
0 < |a| = |x —0| < 0. If Re(L) > 0, then 


Aa —6/2 
re(= 1) =Re(—F -1) = Sie) St, 
Iz | — 6/2| 
so that lial — L| > 1 =e, and if Re(L) < 0, then 


re (= - 1) =Re (2-1) = Reh) 1, 


so that again | — L| > 1 =e. This proves the claim of the example. 
|ac| 


Example 4.2.3. For all LD € C, lim,-,2 —. Ly, 
A geometric reason for the non-existence of this limit is that as x gets closer to 2 


a 
xr—2 


(but not equal to 2), the size of gets larger and larger. 


Proof. Set ¢ = 1. Let 6 > 0 be an arbitrary positive number. Set 6’ = min{d,1/(|Z| + 1)}. 
Let « = 24 6'/2. Then 0 < |x — 2| < 6’ < 6, and 
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20 
> Pl |L| (by the reverse triangle inequality) 
> 2(|L| +1) — |L| (since 6’ < 1/(|Z| + 1)) 
=1 


= €. 


Example 4.2.4. For f : R > R given by the graph below, lim, f(x) does not exist because 
«> 


of the jump in the function at 2. 


Here is an epsilon-delta proof. Say that the limit exists. Call it L. Set «€ = i. Let 6 
be an arbitrary positive number. If L > 3, set ¢ = 2+ min {4, gh. and if L < 3, set 
xz =2-—min{f, Sy. In either case, 


Lo ) 
0 < je 2] =min{ t s}s3<6 
bie Pao 


5, by our poe x = 2+ minf{j, 2h. so that f(x) = 1+ min{4,$} <1+4 
whence | f(x) — L] > 4 =e. Similarly, if L < 3, by our choice x = 2 — min{+, 3}, so that 
f(z) =2—min{Z, $} > 2-—4=1+4 3, whence |f(x) — L| > } =e. Thus no L works, so 
the limit of f(x) as x approaches 2 does not exist. 


Exercises for Section 4.2 
4.2.1. Prove that lim,_,3(32 — 4) 4 —3. 
4,2.2. Prove that lim, ,;(2? + 4) # —5. 


4.2.3. Prove that lim,_,_3 does not exist. 


5 


Section 4.3: More on the definition of a limit 175 


4.2.4. Prove that for all a € R, lim,_,, f(x) does not exist, where f : R > R is defined by 
fers 1, if x is rational; 

~ | 0, if x is not rational. 
4.2.5. Prove that lim,,0(./z — /—2) does not exist. (Hint: The reason is different from 
the reasons in other examples in this section.) 


4.3 More on the definition of a limit 


The purpose of this section is to show that even small changes in the definition of 
limits affect the meaning significantly. A lesson to be learned is that it is important to 
remember any formal statement precisely. 

Here is a restatement of Definition 4.1.1 for lim,., f(x) = L when a is a limit point 
of the domain A of f: 


Ve>0O Ad6>0 Vee A if0< |x -—a| <6 then |f(z) -—L| <e. (4.3.1) 


Example 4.3.2. Suppose that in Statement (4.3.1) we switch the order of the first two 
quantifiers: 


6>0 Ve>0 Vee A if0< |x—-a| <6 then |f(4)-L| <e. 


Let f :C > C be the function given by f(x) = x. By Exercise 4.1.8, lim, f(x) = a, but 
this f does not satisfy the modified definition above because no matter what 6 is taken, the 
conditional fails for any « < 6/2. Thus this modification of the definition of limits changes 


the meaning. 


Example 4.3.3. Suppose that in Statement (4.3.1) we switch the order of the second and 
third quantifiers: 


Ve>0O VaeA Ad>0 if0< |x—a| <6 then |f(xz)-—L| <e. 


Every function f : A > C satisfies this statement because after € and x are fixed we may 
set 6 to be |x — a|/2 if x # a and 1 otherwise. With this, the antecedent 0 < |x —a| <6 
is false, which means that the conditional is true, so that f satisfies the statement. In 
Section 4.2 we saw that limits need not exist, which means that this modification changes 
the meaning of the definition of limits. 
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Example 4.3.4. Suppose that in Statement (4.3.1) we replace the first V with 3: 


de>O Ad>0 VaeEA if0< |x—-al| <6 then |f(z) -—L| <e. 


Then whenever lim,., f(z) = L, the modified statement is satisfied with any complex 
number L’ in place of L. Namely, set € = 1+ |L’—L]. By the definition of lim,;_,, f(x) = L 
there exists 6 > 0 such that for all x € A, if 0 < |x —a| <6 then |f(a) — L| < 1. Then 
f(z) -L'| =|f@)-L+L-L'| 
<|f(x) — L|+|L—L’| (by the triangle inequality) 
<1+|L-L'| 
=e. 
This means that with this modification of the definition of limits, limits would not be 
unique (but they are by Theorem 4.4.1). 


Furthermore, the function Tz]? for which we proved in Example 4.2.2 that no limit 
exists at a = 0, satisfies this last modified statement for any L. 


Example 4.3.5. Suppose that in Statement (4.3.1) we replace the first 4 with VY: 
Ve>0 Vd>0 Vee A if0< |x—-a| <6 then |f(x) -—L| <e. 


We claim that the only functions f that satisfy this statement are those that are constant on 
A\{a}. Namely, suppose that b,c € A\{a}. Then for any « > 0, whenever 6 > |b—a|+|c—al, 
we have that | f(b) — f(c)| = |f(b) -L+L-—- f(c)| < |f(b) — L| +|L— f(o)| < 2e. Then by 
Theorem 2.10.4, | f(b) — f(c)| =0, so that f(b) — f(c) = 0. Thus f is constant as claimed. 
But we know by the first section in this chapter many non-constant functions have limits 
as well, which means that this modification of the definition of limits changes the meaning. 


Example 4.3.6. Suppose that in Statement (4.3.1) we replace the conditional with the 


conjunction: 


Ve>O 3d>0 Vee A O< |x-a| <6 and |f(x)—-—L| <e. 


This modification fails for every function f that is not equal to the constant L on A \ {a}. 
Namely, for the condition 0 < |x —a| < 6 to hold, necessarily 6 must be so large so 
that A C B(a,6); and for |f(x) — L| < € to hold for all x € A \ {a} and all € > 0, by 
Theorem 2.10.4, f(a) = L. Thus again, this modification of the definition of limits changes 


the meaning. 


So far we have examined modifications of Statement (4.3.1) in which we switched 
the order of quantifiers, we switched a quantifier, or we changed the conditional into a 
conjunction. In all cases the modification resulted in a different meaning. We can modify 
Statement (4.3.1) in many other ways and get even further meanings. 
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Example 4.3.7. Another possible modification of Statement (4.3.1) is to omit a quantifier, 
such as the quantifier for x: 


Ve>O Ad>0 if0< |x—-a| <6 then |f(z) -—L|<e. 


It is common to treat the occurrence of x in the statement as tacitly assuming “Vx € A”. 
However, this tacit assumption is not fine! Namely, the negation of this modification is 


e>0 Vd6>0 0< |x-a| <6 and |f(xz) -—Ll Se. 


The lack of the quantifier on x would then again tacitly signal the universal quantifier, 
whereas the correct negation calls for the existential quantifier. In short, we cannot omit 
a quantifier. 

We finish the section with a modification of Statement (4.3.1) that does not change 
the meaning: we replace the last two occurrences of “<” in the statement with “<”. 


Theorem 4.3.8. Let f :A-— C be a function and let a € C be a limit point of A. Then 
lim, +a f(a) = L if and only if 


Ve>0O 3Ad>0 Vee A if0< |x—a| <6 then |f(z)-—L| <e. 


Proof. First suppose that lim,, f(2) = L. We will prove that the modified statement 
also holds. Let « > 0. By assumption there exists 6’ > 0 such that for all x € A, if 
0 < |x —a| < 6’ then |f(x) — L| < «. Set 6 = 6’/2. Let x € A such that 0 < |x —a| <6. 
Then 0 < |x —a] < 0’, so that |f(a) — L| < €, and hence | f(a) — L| < «. Thus the modified 
statement holds. 

Conversely, suppose that the modified statement holds. We will prove that 
lim, +a f(z) = L. Let « > 0. By assumption there exists 6 > 0 such that for all x € A, 
if 0 < |x —a| < 6 then |f(x) — L| < €/2. Let x € A such that 0 < |x —a| < 6. Then 
0 < |x —a| <6, so that | f(a) — L| < «/2 < e. Thus lim,_,, f(x) = L. 


Exercises for Section 4.3 


4.3.1. Prove that the only functions f : C — C that satisfy the statement in Example 4.3.2 
at all a in the domain are the constant functions. Find a non-constant function f : [0,1] U 
[2,3] > C that satisfies that statement at all a in the domain. 
4.3.2. Examples 4.3.2 and 4.3.3 modified Statement (4.3.1) in the ordering of the quanti- 
fiers. Similarly show that the remaining orderings (all on the list below) also change the 
meaning. 

i) 4d6>0 Vr EA Ve>0O if0<|x—-—a| <6 then |f(x) —L| <e. 

ii) V2E A Ve>O0 A6>0 if0<|x—a| <6 then |f(x) —L| <e. 
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ii) Vx € A Ad>0 Ve>O if0<|xz-—a| <6 then |f(x) -—L| <e. 
4.3.3. Assume that a is a limit point of the domain A of f. 
i) Prove that lima f(x) = L if and only if 


Ve>O0 Ad>0 VeeA if0< |x —-—a| <6 then |f(xz) -—L| <e. 


ii) Prove that lim,_,, f(x) = L if and only if 


Ve>0O0 3d>0 Vere A if0< |x—a| <6 then |f(x)-—L| <e. 


4.4 Limit theorems 


While epsilon-delta proofs are a reliable method for proving limits, they do not help 
in deciding what a limit may be. In this section we prove theorems that will efficiently 
establish the limits for many functions. The proofs of these theorems require the epsilon- 
delta machinery — as this is the definition of limits, but subsequent applications of these 
theorems allow us to omit the time-consuming epsilon-delta proofs. 


Theorem 4.4.1. If a limit exists, it is unique. 


Proof. Suppose that both L, and Lz are limits of f(x) as x approaches a. 

First I give a FALSE proof: by assumption, L, = lim, 5, f(x) = Le, so that by 
transitivity of equality, Ly = Do. Xx 
What makes this proof false is that the equal sign in “lim,_., f(z) = L” signifies, at this 
point, not numerical equality, but that a is a limit point of the domain of f and that for 
every € > 0 there exists 6 > 0 such that for all x in the domain of f, if 0 < |x—a| <6 then 
|f(x) — L| < «. Thus, we need a proof that uses this definition. 

Real proof of Theorem 4.4.1: Suppose that LD, and Ly are both limits. Let € be an 
arbitrary positive number. Then €/2 is also positive, and by the definition of limits, for 
each i = 1,2, there exists 6; > 0 such that for all x in the domain of f, if 0 < |x —a| < 6;, 
then | f(a) — L;| < €/2. Set 6 = min{d,, 62}. Then 6 is a positive real number. Let x in the 
domain of f satisfy 0 < |x —a| < 6. Since a is a limit point of the domain, such z exists. 
Then 


|Z, — Lo| = |Li — f(x) + f(x) — Le| (by adding a clever 0) 
< |Z, — f(x)|+|f(x) — Le| (by the triangle inequality) 
< 2€/2 (since 6 < 64,62) 


=, 
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which says that |Z, — L2| < ¢. Since € was arbitrary, an application of Theorem 2.10.4 
gives that |L1 — L2| = 0, so that Ly — Lz = 0, and hence that Ly = 2. 


(If in the definition of lim,_,, f(a) we did not require that a be a limit point of the 
domain of f, then x as in the proof above would not exist for small 6, so any complex 
number L would vacuously satisfy the definition of limits. We would thus not be able 
to guarantee that limits are unique. Perhaps the definition of limits is technical, but the 
technicalities are there for a good reason.) 


Theorem 4.4.2. Let a be a limit point of the domain of a function f with lim,-., f(x) = 
L. Suppose that L £0. Then there exists 6 > 0 such that for all x in the domain of f, if 
0 < |x —a| <6, then |f(x)| > |L|/2. 

In particular, there exists 6 > 0 such that for all x in the domain of f, if0 < |x—a| < 
6, then f(x) £0. 


Proof. Since |L£|/2 > 0, there exists 6 > 0 such that for all x in the domain of f, if 
0 < |x —a| < 6, then |f(a) — L| < |L|/2. Hence by the reverse triangle inequality, for the 
same x, |L|/2 > |f(x) — L| > |L| —|f(x)|, so that by adding | f(x) + |LZ|/2 to both sides we 
get that |f(x)| > |£|/2. In particular f(x) 4 0. 


The following theorem is very important, so study it carefully. 


Theorem 4.4.3. Let A be the domain of f and g, and let a be a limit point of A, and let 
c€C. Suppose that lim,_., f(z) and lim, _.,4 g(x) both exist. Then 

(1) (Constant rule) lim C=c: 

(2) (Linear rule) lim x =(. 

(3) (Scalar rule) jim cf(x) = c lim Fae): 

(4) (Sum/difference rule) lim (f(x) #9(¢)) = lim fa eee lim g(a). 

(5) (Product rule) lim (f(2) - g(x)) = lim f(x) - lim g(x). 
) lim f(z) 


_ «@>a 


(6) (Quotient rule) If jim g(x) #0, then jim a = lim Et 


Proof. Both (1) and (2) are proved in Exercise 4.1.8. 

Set LE = lim,., f(x) and K = lim,_,, 9(z). 

(3) Let « > 0. Then €/(\c] + 1) is a positive number. (Note that we did not divide 
by 0.) Since LZ = lim,_,, f(x), there exists 6 > 0 such that for all x € A, 0 < |x -—a| <6 
implies that | f(a) — L| < «/(\c| + 1). Hence for the same z, 


lef(x) — cL] = lel: |f(a) — £] < lel -€/(el +1) <«, 


which proves that lim, _,,cf(x) = cL =clim,_4, f(z). 
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(4) Let « > 0. Since L = lim,_,, f(x), there exists 6, > 0 such that for all x € A, 
if 0 < |x — a] < 64, then | f(x) — L| < €/2. Similarly, since K = lim,-,, g(x), there exists 
62 > 0 such that for all « € A, if 0 < |x —a| < de, then |g(x) — L| < €/2. 


Set 6 = min{d1, 42}. Then 6 is a positive number. Let x € A satisfy 0 < |x —a| < 6. 
Then (4) follows from: 


\(F(@) # g(@)) — (2 B)| = |(f (2) — £) + (g(a) — K)] (by algebra) 
< |f(v) — L| + |g(x) — K| (by the triangle inequality) 
< €/2 + 6/2 (because 0 < |x — al < 6 < 64, 62) 
=e, 
(5) Let € > 0. Then min{e/(2|L| +2), 1}, min{e/(2|K|+ 1), 1} are positive numbers. 
Since L = lim,_,, f(x), there exists 6; > 0 such that for all x € A, if 0 < |a—a| < 6,, then 


|f(x) — L| < min{e/(2|K|+1),1}. Similarly, since K = lim,-,, g(x), there exists d2 > 0 
such that for all x € A, if 0 < |x —a| < de, then |g(x) — L| < €/(2|L| + 2). 
Set 6 = min{d1, 62}. Then 6 is a positive number. Let x € A satisfy 0 < |x —a| < 6. 
Then by the triangle inequality, 
\f(w)| = fw) - L+ 2] < |f(w)-£)+|L)< 14+ |Z}, 
and so 
If(@)- g(a) —L- BK] =|f(a)- g(a) — f@)K + f(a) K — L- BY 
(by adding a clever zero) 
f(x) - g(a) — f(#)K| + |fl@)K — L- B| 
(by the triangle inequality) 
= |f(@)|- lg) — K+ [f(a eS L|- || (by factoring) 
€ 


1+ |L}) - Kk 6 < 04,6 


<¢/2+6/2 


= €. 


IA 


A 


(6) Let € > 0. Since K #0, by Theorem 4.4.2, there exists dg > 0 such that for all 
x€A, if0< |x —al < do, then |g(x)| > |K|/2. 

The numbers |K|e/4, |K|?¢/(4|Z| + 1) are positive numbers. Since L = limz-+a f(z), 
there exists 6; > 0 such that for all x € A, if 0 < |x —a| < 64, then | f(x) — L| < |Ke/4. 
Similarly, since K = lim,_,q g(x), there exists 62 > 0 such that for all x € A, if 0 < |x—a| < 
62, then |g(x) — L| < |K|?¢/(4|L| + 1). 

Set 6 = min{do, 61, 62}. Then 6 is a positive number. Let x € A satisfy 0 < |x—a| < 6. 
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Then 
fiz) L|_ |f(e)K —Log(z) aleebra 
fray el~ [oma | OY alee 
f(®)K —-LK+LK — Lg(z) 
= Kg) (by adding a clever zero) 
f(a)K —-LK LK — Lg(x) 
< 
~ | Kg(x) Kg() 


(by the triangle inequality) 
=e 2) |e 
g(x) JA]} | g(x) 
JKle 2 | |L| |K/Pe 
4 |K|" |K/4|L[ +1 [K 
<¢€/2+6€/2 


= €. 


(by factoring) 


(since 6 < do, 01, 62) 


This proves (6) and thus the theorem. 


Theorem 4.4.4. (Power rule for limits) Let n be a positive integer. If limz-._ f(x) = L, 
then linttya¢ f(a) SL 


Proof. The case n = 1 is the assumption. Suppose that we know the result for n —1. Then 


lim f(x)" = lim (f 
=-lim ( 7 
= L"~'. L (by induction assumption) 


=". 


lead lim (f(x)) (by the product rule) 


So the result holds for n, and we are done by mathematical induction. 


Theorem 4.4.5. (Polynomial function rule for limits) Let f be a polynomial func- 
tion. Then for all complex (or real) a, limy-+a f(x) = f(a). 


Proof. Because f is a polynomial function, it can be written as 
f(z) = co tex + cou? +0302 +--+ + en2" 


for some non-negative integer n and some constants cg,C1,...,Cn. By the linear rule, 
limz-+a 2 = a. Hence by the power rule, for all i = 1,...,n, lim,4,2° = a’. By the 
i 


constant rule, lim;-,,¢; = c;, so that by the product rule limz-,, cj2’ = cja*. Hence by 


repeating the sum rule, 


] — |j 2 arine fh 
lim f(x) = lim (co + cit + eg + +++ + nx") 
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= cy +e1a + Coa? +--- +e,a” 


= f(a). 


Theorem 4.4.6. (Rational function rule for limits) Let f be a rational function. 
Then for all complex (or real) a in the domain of f, limz-+a f(x) = f(a). 


Proof. Let a be in the domain of f. Write f(x) = g(x)/h(x) for some polynomial functions 
g,h such that h(a) 4 0. By Theorem 2.4.15, the domain of f is the set of all except finitely 
many numbers, so that in particular a is a limit point of the domain. By the polynomial 
function rule for limits, lim, ., g(x) = g(a) and lim, ,, h(x) = h(a) # 0. Thus by the 
quotient rule, lim,., f(z) = g(a)/h(a) = f(a). 


Theorem 4.4.7. (Absolute value rule for limits) For all a € C, lim,-,, |x| = |al. 


Proof. This function is defined for all complex numbers, and so every a € C is a limit point 
of the domain. Let « > 0. Set 6 = «. Then for all x € C, by the reverse triangle inequality, 


||z| — |al| < |x -—a|<d=e. 


Theorem 4.4.8. (Real and imaginary parts of limits) Let f : 4 > C be a func- 
tion, let a be a limit point of A, and let L € C. Then lim,,, f(x) = L if and only if 
lim;+a Re f(z) = Re L and lim,_,, Im f(x) = Im L. 


Proof. First suppose that lim,+, f(x) = L. Let « > 0. By assumption there exists 6 > 0 
such that for all x € A, 0 < |x —a| <6 implies that | f(a) — L| < «. Then for the same z, 


| Re f(x) — Re L| = | Re(f(x) — L)| < fw) -L] <«, 


and similarly |Imf(#) — ImZ| < e€ which proves that lim,,,Ref(rz) = ReLZ and 
lim,;+aIm f(#) = Im L. 

Now suppose that lim,_,, Re f(z) = Re L and lim,_,, Im f(x) = Im L. By the scalar 
and sum rules in Theorem 4.4.3, and by the definition of real and imaginary parts, 


lita fe) = lim (Re f(x) +iIm f(z)) 


ra 


lim Re f(x) + 7 lim Im f(z) 
=ReL+iImL 
=, 
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Theorem 4.4.9. (The composite function theorem) Let h be the composition of 
functions h = go f. Suppose that a is a limit point of the domain of go f, that lim Fey a 
and that lim g(x) = g(L). Then lim h(x) = lim(go f)(x) = g(L). 

col @z—a ma 


Proof. Let € > 0. Since im g(x) = g(L), there exists 6; > 0 such that for all x in the 
domain of g, if 0 < |jr—a| < 6; then |g(x)—g(L)| < €. Since jim f(x) = L, there exists 6 > 0 
such that for all x in the domain of f, if 0 < |x —a| < 6 then | f(a) — L| < 6,. Thus for the 
same 6, if x is in the domain of h and 0 < |a—a| < 6, then |h(x)—g(L)| = |g(f(x))—g(L)| < € 
because | f(x) — L| < 6y. 


Perhaps the hypotheses on g in the theorem above seem overly restrictive, and you 


think that the limit of g(a) as x approaches L need not be g(L) but an arbitrary kK? 
Consider the following example which shows that lim;-.4 g(f(x)) then need not be kK. Let 
_ _ J 3, ia #5; 
SD ee © otherwise. ’ 
lim f(x) =5, lim g(x) =3, and limo fir))=— 7% 
«wL—> 


La ra 


Then 


Theorem 4.4.10. Suppose that f,g:A— R, that a is a limit point of A, that lim,., f(x) 
and lim;-4a g(x) both exist, and that for all x € A, f(x) < g(x). Then 


lim f(x) < lim g(2). 


ra 
Proof. Let L = lim fie} e= jim g(x). Let € > 0. By assumptions there exists 6 > 0 
such that for all x € A, if 0 < |x —a| < 6, then |f(x) — L|,|g(x) — K| < €/2. Then for the 
same £, 

K -L=K — g(x) +9(2) — f(a) + fa) -L > K-g(a) + f(@) -L 
> -|K — g(x) + f(a) - 1 
> —|K — g(x)| — |f(x) — L| (by the triangle inequality) 
> -—€/2 —€/2 = —«. 


x 


x 


Since this is true for all « > 0, by Theorem 2.10.4, K > L, as desired. 


Theorem 4.4.11. (The squeeze theorem) Suppose that f,g,h: A — R, that a is a 
limit point of A, and that for all x € A \ {a}, f(x) < g(a) < h(x). If lim,., f(x) and 
lim;+a h(x) both exist and are equal, then lim,-4q g(x) exists as well and 


lim f(z) = lim g(x) = tim h(a). 


ra wa 


Proof. [IF WE KNEW THAT lim,4q9(”) EXISTED, THEN BY THE PREVIOUS THEOREM, 
lim; +a f(z) < limg4.9(z) < limg4, h(x) = limz4, f(z) WOULD GIVE THAT THE THREE 
LIMITS ARE EQUAL. BUT WE HAVE YET TO PROVE THAT lim,-,¢ g(“) EXISTS. | 
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Let L = lim, +, f(z) = lim,,,h(x). Let « > 0. Since lim, ,, f(z) = L, there 
exists 6; > 0 such that for all x, if 0 < |x —a|] < 6; then |f(x) — L| < e. Similarly, 
since lim, +_ h(a) = L, there exists 62 > 0 such that for all x, if 0 < |x —a| < 62 then 
|h(a) — L| < «. Now set 6 = min{6,, 62}. Let x satisfy 0 < |x —a| < 6. Then 


-e< f@)-Lseg@)- Ls hile) -L <e, 


where the first inequality holds because 6 < 6;, and the last inequality holds because 6 < dg. 
Hence —e < g(x) — L < €, which says that |g(x) — L| < €, so that lim,_., g(x) = L. 


Exercises for Section 4.4 


4.4.1. Determine the following limits by invoking appropriate results: 
i) lim (x? — 4a — 27), lim(x? + 5). 
r—2 xr—2 


3 A» _ 97\3 
ii) lim ee) 
a2 (22+ 5)? 
|x? — 4r — 27| 


4.4.2. Assume that a is a limit point of the intersection of the domains of complex-valued 
functions f and g. Let lim f(x) = Land jim g(x) = K. Prove that lim (3 f(x)” —Ag(x)) = 
3L? — 4K. 

4.4.3. Prove that lim x” =a” for any non-zero complex number a and any integer n. 


4.4.4. The following information is known about functions f and g: 


a fla) tim f(x) g(a) tim ge) 
0 1 2 6 4 

1 —1 0 5 5 

2 not defined 3 6 —6 

3 4 4 not defined 5 

4 5 1 4 rf 


For each part below provide the limit if there is enough information, and justify. (Careful, 
the answers to the last two parts are different.) 


i) lim (f(@) ~ g(e)). 
lim (f(@) + 9(@)). 
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m (go f)(x). 


li 
zl 


vii) lim (g 0 f)(2). 


li 
rd 


vi) 
4.4.5. By Example 4.2.2 we know that im al does not exist. 

i) Prove that iim (1- a) does not exist. 

ii) Prove that iim (ay + (1- iW) exists. 

iii) Do the previous parts contradict the sum rule for limits? Justify. 
4.4.6. Find functions f,g such that lim,-.4(f(x)g(x)) exists but such that lim,-,, f(z) 
and lim, 4 g(x) do not exist. Does this contradict the product rule? 
4.4.7. Prove that if lim,, f(x) = L, then lim,,,|f(x)| = |Z]. Give an example of a 
function such that lim,_,, |f(ax)| = |Z| and lim,_,, f(x) does not exist. 
4.4.8. Let AC BCC, let a be a limit point of A, and let f: A> C,g: B—C. Suppose 


that for all z € A\{x}, g(x) = f(x). Prove that if lim;-,. g(x) = D, then lim;-,, f(x) = L. 
In particular, if a € R is a limit point of A = BOR and if lim,_,, g(x) = L, then the 


restriction of g to A has the same limit point at a. 


4.4.9. Let ACC, let f,g: A—-C, and let a be a limit point of A. Suppose that for all 
x €A\ {a}, f(x) = g(x). Prove that lim,_,, f(x) exists if and only if lim,_,, g(x) exists, 
and if they both exist, then the two limits are equal. 


4.4.10. Let f,g : IR — R be defined by 
re={5 ifxEeQ te) ={ 4 ie 20; 


otherwise, 0, otherwise. 


i) Prove that lim f(@}=—0. 
xL— 
ii) Prove that lim, Clea me 
xu— 
iii) Prove that lim (g o f)(x) does not exist. 
x“ 
) 


iv) Comment on the relevance of Theorem 4.4.9 in this example. 


*4.4.11. Let a,L,M be arbitrary real numbers. Construct functions F,G : R — R such 
that lim,., F(z) = L, lim,;,,G(z) = M, but lim,_,,(G o F)(x) #4 M. Comment on 
Theorem 4.4.9. (Hint: modify f and g from the previous exercise.) 


4.4.12. (Due to Jonathan Hoseana, “On zero-over-zero form limits of special type”, The 
College Mathematics Journal 49 (2018), page 219.) Let f : [a,b] > R be a continuous 
function. Let A be a subset of C, let a € C be a limit point of A, and let f,..., f,,g be 
complex-valued functions defined on A such that for all k = 1,...,n, limyya f(x) = 1 and 


lim; +a 2 ee) : exists. Prove that 


fila) fo(x)---fe(@)-1_ way, fala) 


lim = lim = 
ra g(x) <a g(x) 
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4.5 Infinite limits (for real-valued functions) 


When the codomain of a function is a subset of an ordered field such as R, the values 
of a function may grow larger and larger with no upper bound, or more and more negative 
with no lower bound. In that case we may want to declare limit to be oo or —oo. Naturally 
both the definition and how we operate with infinite limits requires different handling. 


Definition 4.5.1. Let ACC, f: A—R a function, anda € C. Suppose that a is a limit 
point of A (and not necessarily in A). 

We say that the limit of f(x) as x approaches a is oo if for every real number 
M > 0 there exists a real number 6 > 0 such that for all x € A, if 0 < |x —a| < 6 then 
f(z) > M. We write this as lim f(x) = lim,4 f(x) = co. Similarly we say that the 
limit of f(x) as x approaches a is —oo if for every real number M < 0 there exists a 
real number 6 > 0 such that for all x € A, if 0 < |x —a| <6 then f(x) < M. We write this 
as lim Fle) = ling f (= Sees 

: "The limit of f(x) as x approaches a from the right is —oco ifa is a limit point 

of {x € A: x >a}, and if for every real number M < 0 there exists a real number 6 > 0 
such that for all x € A, if0 <x—a< 6 then f(x) < M. This is written as lim, f(x) 
= lim,_,q+ f(x) = —O. _ 

It is left to the reader to spell out the definitions of the following: 


lim f(z) =o0, lim f(z)=c, lim f(x) = —o. 
zat L—a~ L—a- 


Note that we cannot use epsilon-delta proofs: no real numbers are within «€ of infinity. 
So instead we approximate infinity with huge numbers. In fact, infinity stands for that thing 
which is comparable to and larger than any real number. Thus for all M we can find x 
near a with f(x) > M is simply saying that we can take f(x) arbitrarily large, which is 
more succinctly expressed as saying that f(x) goes to oo. (As far as many applications are 
concerned, a real number larger than the number of atoms in the universe is as close to 
infinity as realistically possible, but for proofs, the number of atoms in the universe is not 
large enough.) 


Example 4.5.2. lim, ae = 0. 


Proof. 0 is a limit point of the domain (in the field of real or complex numbers) of the 
function that takes x to 1/|xz|?. Let M be a positive real number. Set 6 = 1/VM. Let x 
satisfy 0 < |a — 0| < 6, i-e., let x satisfy 0 < |x| < 6. Then 
ci 
———s > es 
jzj2~ 82 ( 
=M. 


because 0 < |x| < 6) 
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Example 4.5.3. lim,_,5+ <3¢3< = 00. 
Proof. Certainly 5 is the limit point of the domain of the given function. Let M > 0. Set 
6 = min{1, wm}: Let x satisfy 0< «—5 <0. Then 


xr+2 2 o+2 
w?—25° 2-25 


(because x > 5 and x? — 25 > 0) 


7 i 
= : by algeb 
(x + 5) pap ee za) 
fi 
>: (because 0 < x—-5<d6<1,s0o0<2+5< 11) 
1l x-5 
> i Lib 0< 5 <0) 
Tg (because i 
7 1 
2 je (because 6 < ~5,, so 1/6 > 1/(=5,)) 
lim 
=i: 


Example 4.5.4. lim, _,9+ 4 =o. 


Proof. Let M > 0. Set 6=1/M. Then for all x withO<«-0<6,4>1/6=M. 


Example 4.5.5. lim,_,9- 4 = =oo. 


Proof. Let M < 0. Set 6 = —1/M. Then 6 is a positive number. Then for all x with 
0 < 0-2 < 0, and so by compatibility with multiplication by the positive real number 


and so by compatibility of order with addition of 4 ae 


1 1 
eo 
x ) 


Example 4.5.6. We conclude that lim,_,9 4 cannot be a real number, and it cannot be 
either oo or —oo. Thus lim,_,o 4 does not exist. 


The following theorem is straightforward to prove, and it is left to the exercises. 
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Theorem 4.5.7. Let f,g,h : A — R, and let a be a limit point of A. Suppose that 


lim, +a f(z) = D ER, limz 4, g(x) = 00, limz_,, h(x) = —oo. Then 
(1) (Scalar rule) For any c € R, 
oO, ire =; —oo, ifc>O0; 
lim (cg(a)) = —oo, ifc<0; lim(ch(z)) = po. ° ite 0; 
ea 0, ife=0; ~" 0, ife=0. 
(2) (Sum/difference rule) 
lin (2) + g(e)) = too, 
lim (f(x) + h(x)) = Foo, 
lim (g(2) — h(a)) = 00, 
lim (h(x) — g(@)) = —00, 


but we do not have enough information to determine whether the limit of g(x) + h(x) as x 
approaches a exists. 


(3) (Product rule) 
OO, if L > 0; 
lim (f(a) - g(@)) = { —oo, ifL <0; 


lim (f(x) - h(x)) = 4 if L > 0; 


La OO, if L < 0; 
lim (g(a) - h(a)) = —oo. 


We do not have enough information to determine the existence (or value) of limz-4a(f (x) - 
g(x)) and of limy .a(f(x) -h(x)) in case L = 0. 
(4) (Quotient rule) 


lim —~ = 0, 

ra g(x) 

lim f(z) = 0, 

ra h(a) 

in g(x) ee lee er if L > 0; 
za f(x) = —Ow, OP de 0; 
5 h(x) _f{ -o0, if L >0; 
ra f(z) = OO, ifL< 0. 

We do not have enough information to determine the existence (or value) of Arnau le(e) 


and of Mia te) in case L = 0. We also do not have enough information to determine 


the existence (or value) of Bieta t) 
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Example 4.5.8. Define g,h1, h2,h3,ha : R \ {0} > R by g(x) = 1/2x?, hi(x) = —-1/2?, 
ho(x) = 17 — 1/2”, h3(x) = —1/x? — 1/2+, ha(x) = —1/x2? — 1/23. We have seen that 
lim;—+0 g(x) = oo, and similarly that limz.9hi(x) = limz4o he(x) = limzg4o9h3(x) = 


lim, .9 ha(x) = —oo. However, 


lim (g(x) + ha(2)) = lim 0 = 0, 


xz—0 


lim (g(a) + ho(x)) = lim 17 = 17, 
x—0 x—0 


lim (g(x) + ha(2)) = lim(—1/s") = —00, 
but lim, .0(g(x) + ha(x)) = lim,_,9(—1/x?) does not exist. This justifies the “not enough 
information” line in the sum/difference rule in Theorem 4.5.7. 
Other “not enough information” lines are left for the exercises. 
Distinguish between the scalar and the product rules: when lim,_,, f(x) = 0, if f is 
a constant function, then lim,.a(f(x)g(x)) = 0, but if f is not a constant function, then 


we do not have enough information for lim, .,(f(x)g(x)) = 0. 


Exercises for Section 4.5 
4.5.1. Give definitions for the following limits. 
1) ding -4 ef (ey SCe. 
ii) [ii gf (a) Sess 
iii) lim, ,,- f(#) = —oo. 
4.5.2. Prove that lim,_,o+ aa = OO. 
4.5.3. Prove that lim,_,9- - = —OO. 
4.5.4. Referring to Theorem 4.5.7, prove the scalar rule, the sum/difference rule, the 
product rule, and the quotient rule 
4.5.5. Let g(x) = 1/27, fi(x) = 2°, fo(x) = x7, f3(x) = 1727, fa(x) = 2. It is easy to see 
that lim, .9'9(2) = 00, lim, 49 fie) = lim, ¢:fo(a) = limy so fa(e) = ims 6-f4(2) = 0: 
i) Compute lim,_,0(fx(x)g(x)) for k = 1, 2,3. 
ii) Justify the first “not enough information” in the product rule in Theorem 4.5.7. 
iii) Justify the second “not enough information” in the product rule in Theorem 4.5.7. 
4.5.6. Justify the three “not enough information” in the quotient rule in Theorem 4.5.7. 


(Hint: Study the previous exercise.) 


4.6 Limits at infinity 


190 Chapter 4: Limits of functions 


Definition 4.6.1. Leta ¢ Rand LEC. 

For f : (a,co) > C, we say that limz+4. f(x) = L if for every € > 0 there exists a 
real number N > 0 such that for all x > N, |f(x) — L| <. 

For f : (—oo,a) > C, we say that lim, ._. f(x) = L if for every « > 0 there exists 
a real number N < 0 such that for all x < N, |f(x) — L| <e. 


Below we switch to real-valued functions. 


For f : (a,0o) > R, we say that limy-,.. f(x) = 00 if for every M > 0 there exists a 
real number N > 0 such that for allax > N, f(x) > M. We say that lim, .. f(x) = —oo 
if for every M < 0 there exists a real number N > 0 such that for alla > N, f(x) < M. 

For f : (—oo,a) > R, we say that lim;-,-.. f(x) = oo if for every M > 0 there exists 
areal number N < 0 such that for alla < N, f(x) > M. We say that lim, ,—. f(x) = —oo 
if for every M < 0 there exists a real number N < 0 such that for alla < N, f(x) < M. 


Example 4.6.2. limz...(x° — 162+) = o. 


Proof. Let M > 0. Set N = max{17,M1/4}. Then for all x > N, 
x — 16x* = x*(x — 16) 
> x* (because x > N > 17) 
> N* (because 2 > N) 
> (M/*)4 (because N > M1/4) 
= M. 


: 2° —1624 
Example 4.6.3. lim; ... “s7p7 = 


Proof. Let € > 0. Set N = max{1,20/e}. Then for all x > N, 


x? — 1624 | 2° — 1624 — x° — 4a? 


e+ 4e2 x + 4x2 
—1624 — 4x? 
x + 4x2 
1624 + 4x? 
x +49 
16x* + 4x? 
= orate (because x > N > 0) 
x x 
4 2 
< 16x < 
x 
4 4 
Z 16x — 
ic 
2024 
Fee 


(because x° + 4x? > x° > 0) 


(because x > N > 1 so that x? < 2+) 
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20 


a 
20 

W (because x > N) 
Eau N > 20/e) 
< x0 ecause N > 20/e 


= €. 


Exercises for Section 4.6 
4.6.1. Prove that lim,_,..(x? — x) = 00 and that lim,_,_..(47 — x) = oo. 
4.6.2. Prove that limy_,o.(x? — 2) = oo and that lim, ,_..(x3 — x) = —oo. 


4.6.3. Is there a limit rule of the following form: If lim,-,~ f(x) = oo and lim;-, g(x) = 
oo, then lim, ...( f(x) — g(x)) can be determined? 


4.6.4. Prove the following limits: 
ae? — xe 

spat = 3/2 
3a%—4e7+1 _ 


4.6.5. For which rational functions f does lim;_,~ f(x) = 0 hold? Justify. 

4.6.6. For which rational functions f does lim,_,.. f(x) exist and is not too? Justify. 
4.6.7. Let f : (a,co) ~ C and L € C. Prove that lim, .. f(x) = L if and only if 
lim;-+oo Re f(x) = Re L and limz_,.. Im f(z) = Im L. 


4.6.8. Let f : (—co,b) > C and L € C. Prove that limz,-. f(z) = L if and only if 
lim, .—oo Re f(z) = Re £ and lim,_,_.. Im f(z) = Im L. 


Chapter 5: Continuity 


Continuous functions from an interval in R to R are the ones that we can graph with- 
out any holes or jumps, i.e., without lifting the pencil from the paper, so the range of such 


functions is an interval in R as well. We make this more formal below, and not just for func- 


tions with domains and codomains in R. The formal definition involves limits of functions. 
All the hard work for that was already done in Chapter 4, so this chapter, after absorbing 
the definition, is really straightforward. The big new results are the Intermediate value 
theorem and the Extreme value theorem for real-valued functions (Section 5.3), existence 
of radical functions (Section 5.4), and the new notion of uniform continuity (Section 5.5). 


5.1 Continuous functions 


Definition 5.1.1. A function f : A — B is continuous at a € A if for all real numbers 
€ > 0 there exists a real number 6 > 0 such that for all x € A, |x —a| < 6 implies that 
|f(x) — f(a)| < «. We say that f is continuous if f is continuous at all points in its 
domain. 


Much of the time for us A is an interval in R, a rectangle in C, or B(a,r) in R or C, 
et cetera. In the more general case, A may contain a point a that is not a limit point of A. 


Theorem 5.1.2. Let f: A — B bea function anda € A. 
(1) If a is not a limit point of A, then f is continuous at a. 
(2) Ifa is a limit point of A, then f is continuous at a if and only iflimz+q f(x) = f(a). 


Proof. In the first case, there exists 6 > 0 such that B(a,6)N A= {a}. Thus by definition, 
the only x € A with |x — a| < 6 is x =a, whence |f(x) — f(a)| = 0 is strictly smaller than 
an arbitrary positive «. Thus f : A > B is continuous at a. 

Now assume that a is a limit point of A. Suppose that f is continuous at a. We need 
to prove that lim,-4 f(z) = f(a). Let € > 0. Since f is continuous at a, there exists 6 > 0 
such that for all x € A, if |x —a| < 6, then | f(x) — f(a)| < «. Hence for all 0 < |x—a| <6, 
|f(x) — f(a)| < e, and since a is a limit point of A, this proves that lim;-.4 f(z) = f(a). 

Finally suppose that lim,_., f(x) = f(a). We need to prove that f is continuous at 
a. Let € > 0. By assumption there exists 6 > 0 such that for all x € A, if 0 < |~n—a| <6 
then |f(x) — f(a)| < «. If = a, then |f(x) — f(a)| = 0 < e, so that for all x € A, if 
|x —a| <6, then |f(x) — f(a)| < «. Thus f is continuous at a. 
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The next theorem is an easy application of Theorems 4.4.3, 4.4.4, 4.4.5, 4.4.6, 4.4.7, 
4.4.8, and 4.4.9, at least when the points in question are limit points of the domain. When 
the points in question are not limit points of the domain, the results below need somewhat 
different but still easy proofs. 


Theorem 5.1.3. We have: 
(1) (Constant rule) Constant functions are continuous at all points of the domain. 
(2) (Linear rule) The function f(x) = x is continuous (at all real/complex numbers). 
(3) (Absolute value rule) The function f(x) = |x| is continuous (at all real/complex 
numbers). 
(4) (Polynomial function rule) Polynomial functions are continuous (at all real/complex 
numbers). 
(5) (Rational function rule) Rational functions are continuous (at all points in the 
domain). 
(6) (Real and imaginary parts) f is continuous at a if and only if Ref and Im f are 
continuous at a. 
Suppose that f : A — C is continuous at a € A. Then 
(7) (Scalar rule) For any c € C, cf is continuous at a. 
(8) (Power rule) For any positive integer n, the function that takes x to (f(x))” is 
continuous at a. 
(9) (The composite function rule) If g is continuous at f(a), then go f is continuous 
at a. 

(10) (The composite power rule) If f is composable with itself i.e., if the range of f is 
a subset of the domain of f, then f” = fo fo---of is continuous at a. (Example 
of a function that is not composable with itself: f(x) = Ina.) 

(11) (Restriction rule) For any subset B of A that contains a and in which a is a limit 
point, the restriction of f to B is continuous at a. In other words, if g : B — C is 
defined as g(x) = f(a) for all x € B, then g is continuous at a. (Such g is called 
the restriction of f to B. By this rule and by (1), the constant functions with 
domain a sub-interval in R are continuous. ) 

Now suppose that f,g:A-— C, and that f and g are continuous at a € A. Then 

(12) (Sum/difference rule) f +g and f —g are continuous at a. 

(13) (Product rule) f -g is continuous at a. 

(14) (Quotient rule) If g(a) 40, then f/g is continuous at a. 


The theorem covers many continuous functions, but the following function for exam- 
ple has to be verified differently. 
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Example 5.1.4. Let f : C > C be defined by 


a= g?=4, if Rex > 1; 
—$7°, . if Rez < 1. 


Then f is continuous at 1, because by the polynomial rules, 


li — eek eh a eee eee 
Pe Rs eed - ) 2) 


lim f(#)= lim  —-32?=-3-1?=-3, 
z—1,Rear<l z—>1,Rer<l 


so that lim, 51 f(z) = -3 = —3- 1° = f(1). 


Note however that this function is not continuous at 1 +72: 


li = li 2A) SS (ag) SH A= SA 
ia ae v) Pare ee ) ( 8) a _ 
lm — f(x) = lm —3¢7 =—3-(1+4)? =6- 6i. 

z—1+i,Rer<l x—>1+i,Rer<l 


so that lim,+14; f(a) does not exist. 


Example 5.1.5. The function g : R — R defined as the restriction of the function in 


Example 5.1.4 equals 


WMI) 393 ife <1, 


Since f is continuous at 1, so is g. Explicitly, by the polynomial rules, 


lim g(x) = lim (2? — 4) = 17-4=-3, 


x>1t ait 
lim g(¢) =: lim —3273 = —3-1°=-3, 
zl zl 


so that (by Exercise 4.1.6), limz1 g(x) = —3 = —3-1° = g(1). Thus g is indeed continuous 
at 1. By the polynomial rules, g is continuous at all other real numbers as well. Thus g 
is continuous — even if it is a restriction of a non-continuous function. It is worth noting 
that precisely because of continuity we can write g in the following ways as well: 
eA. at ol: gt A. if a> ds 
a) oe ife<1 { 3a", if oe 1, 
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Example 5.1.6. Let f : (—1,0)U(0,1] > R be defined by 


a Ss Tae, 
io if a SO. 


This function is continuous at all a < 0 (in the 
domain) because near such a the function f equals 
the polynomial/linear function f(#) = «+1. Simi- 
larly f is continuous at all a > 0. Thus f is contin- z 
uous at all points in its domain. 


Exercises for Section 5.1 

5.1.1. Let cE C. 
i) Prove that the function f : C > C given by f(x) = c+ = is continuous. 
ii) Prove that the function g : C > C given by g(x) = cz is continuous. 


5.1.2. Give an e — 6 proof to show that the functions Re, Im : C > R are continuous. 


5.1.3. Give details of the proofs of Theorem 5.1.3. 


5.1.4. The following information is known about functions f and g: 


cfc) tim f(@)_—gle) tim g(a) 
0 1 2 3 4 

if —1 0 5 5 

2 not defined —oco 6 —6 

3 4 4 not defined OO 

4 2 3 3 2 


i) At which points is f continuous? 

ii) At which points is |g| continuous? 

iii) At which points is fg continuous? 
5.1.5. Define a function f : C\{0} > R such that for all non-zero x € C, f(x) is the angle 
of x counterclockwise from the positive real axis. Argue that f is not continuous. 
5.1.6. Let f : AC be a function, and let a € A be a limit point of A. Letg: A B 
be a continuous invertible function such that its inverse is also continuous. Prove that 
lim, +a f(x) = L if and only if lim,_,9(a) f(g” *(z)) = L. 


x, if x is rational; 
0, otherwise. 


5.1.7. Let f : RR be given by f(x) = { 


i) Prove that f is continuous at 0. 
ii) Prove that f is not continuous anywhere else. 
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*5.1.8. (The Thomae function, also called the popcorn function, the raindrop function, 
and more) Let f : R — R be defined as 
- if « = ®, where p € Z and q ¢ N* 
(= and p and qg have no common prime factors; 
0, if x is irrational. 
i) Prove that f is not continuous at any rational number. 


ii) Prove that f is continuous at all irrational numbers. 


The exercises below are a further play on Section 4.3: they modify the definition of conti- 
nuity to get very different types of functions. — The moral is that the order of quantifiers 


and implications is very important! 


5.1.9. (This is from page 1177 of the Edward Nelson’s article “Internal set theory: a new 
approach to nonstandard analysis.” Bull. Amer. Math. Soc. 83 (1977), no. 6, 1165-1198.) 
A function f : A > B is suounitnoc at a € A if for all real numbers € > 0 there exists a 
real number 6 > 0 such that for all x € A, |x — a| < € implies that | f(x) — f(a)| < 6. 

i) Prove that if f is suounitnoc at some a € A, it is suounitnoc at every b € A. 

ii) Let f : R* > R be given by f(x) = 1/2. Prove that at every a € R‘, f is 


continuous but not suounitnoc. 


iii) Let f : RR > R be given by f(x) = 1 if z is irrational and f(x) = 0 if x is rational. 


Prove that at every a € R, f is suounitnoc but not continuous. 


5.1.10. A function f : A > B is ticonnuous at a € A if there exists a real number 
6 > 0 such that for all real numbers € > 0 and for all x € A, |x — a| < 6 implies that 
|f(x) — fla)| <e. 
i) Suppose that f is ticonnuous at some a € A. Prove that there exist a real number 
6 >0 and b € B such that for all x € A, |x — a| < 6 implies that f(x) = b. 
ii) Give an example of a continuous function that is not ticonnuous at every a in the 
domain. 
iii) Prove that every function that is ticonnuous at every point in the domain is con- 
tinuous. 
5.1.11. A function f : A > B is connuousti at a € A if for all real numbers € > 0 there 
exists a real number 6 > 0 such that for all x € A, | f(x) —f(a)| < € implies that |x—a| < 6. 
i) Let f : R > R bea constant function. Prove that f is not connuousti at any a € R. 
ii) Let f : RR be given by f(x) = 1 if z is irrational and f(x) = 0 if x is rational. 


Prove that f is not connuousti at any a € R. 
iii) Let f : R > R be given by f(x) = z if z is irrational and f(x) = «+ 1 if z is 
rational. Prove that at every a € R, f is connuousti. 
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5.2 Topology and the Extreme value theorem 
Topology and continuity go hand in hand (see Exercise 5.2.5), but not in the obvious 


way, as the next two examples show. 
(1) If f : A—> B is continuous and A is open, it need not be true that the range of f 


is open in B, even if A is bounded. For example, let f : R- Ror f: (—1,1) >R 
be the squaring function. Certainly f is continuous and R is an open set, but the 
image of f is [0,00) or [0,1), which is not open. 
(2) If f : A— B is continuous and A is closed, it need not be true that the range of 
f is closed in B. For example, let f : R > R be given by f(x) = Ta: This f is 
continuous as it is a rational function defined on all of R. The domain A = R is 


a closed set (and open), but the image of f is (0,1], which is not closed (and not 
open). 
However, if f is continuous and its domain A is closed and in addition bounded, then 
it is true that the image of f is closed. This fact is proved next. 


Theorem 5.2.1. Let A be a closed and bounded subset of C or R, and let f : A > C be 
continuous. Then the range of f is closed and bounded. 


Proof. We first prove that the range is closed. Let 6 be a limit point of the range. We want 
to prove that b is in the range. Since 6 is a limit point, by definition for every positive real 
number r, B(b,r) contains an element of the range (even an element of the range that is 
different from 6). In particular, for every positive integer m there exists 2, € A such that 
f (am) € B(b,1/m). If for some m we have f(x) = b, then we are done, so we may assume 
that for all m, f(am) 4 b. Thus there are infinitely many x,,. As in Construction 3.14.1, 
we can construct nested quarter subrectangles R,, that contain infinitely many x,,,. There 
is a unique complex number c that is contained in all the R,. By construction, c is the 
limit point of the set of the z,,, hence of A. As A is closed, c € A. But f is continuous 
at c, so that for all « > 0 there exists 6 > 0 such that for all x € A, if |a —c| < 6 then 
|f(a) — f(c)| < €/2. In particular, for infinitely many large m, |am —c| < 6, so that for 
these same m, |f (am) — f(c)| < €/2. But for all large m we also have |f(xz,) — b| < €/2, 
so that by the triangle inequality, for |f(c) — b| < «. Since this is true for all positive e, it 
follows that f(c) = 6. Thus any limit point of the range is in the range, so that the range 
is closed. 

Next we prove that the range is bounded. If not, then for every positive integer 
m there exists x, € A such that |f(vm)| > m. Again we use Construction 3.14.1, and 
this time we construct nested quarter subrectangles R,, that contain infinitely many of 
these x,,. As before, there is a unique complex number c that is contained in all the R,, 
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and in A, and as before (with « = 2), we get that for infinitely many m, |f(am)—f(c)| < 1. 
But |f(am)| > m, so that by the reverse triangle inequality, 


IF) 2 [F(am) — (Fem) — F(e))| 2 |F (am) - |F(am) — (| > m-1 


for infinitely many positive integers m. Since |f(c)| is a fixed real number, it cannot be 
larger than all positive integers. Thus we get a contradiction to the assumption that the 


range is not bounded, which means that the range must be bounded. 


Theorem 5.2.2. (Extreme value theorem) Let A be a closed and bounded subset of C, 
and let f : A > R be a continuous function. Then there exist l,u € A such that for all 
xeA, 


FU) < f(x) < Fl). 
In other words, f achieves its maximum value at u and its minimum value at I. 
Proof. By Theorem 5.2.1, the range { f(a) : « € A} of f is a closed and bounded subset 


of R, so that its infimum L and supremum U are real numbers which are by closedness in 
the range. Thus there exist u,/ € A such that L = f(l) and U = f(u). 


Example 5.2.3. The function f : [-2,2] — R given by f(x) = 2? — 6 + 5 achieves a 
minimum and maximum. We can rewrite the function as f(x) = (x — 3)? — 4, from which 
it is obvious that the minimum of the function is achieved at 3 — but wait a minute, this 
function is not defined at 3 and hence cannot achieve a minimum at 3. Here is a correction: 
the quadratic function (2 —3)? —4 achieves its minimum at 3 and is decreasing on (—o00, 3), 
so that f achieves its minimum on [—2, 2] at —2 and its maximum at 2. 

Exercises 5.2.2 and 5.2.3 below give examples of continuous invertible functions whose 
inverses are not continuous. The next theorem contains instead some positive results of 


this flavor. 


Theorem 5.2.4. Let F = R or F = C. Let A,B be subsets of F’, let A be closed and 
bounded, and let f : A—+ B be continuous and invertible. Then f—' is continuous. 


Proof. We need to prove that f~! is continuous at every b € B. Let € > 0. Set a= f—1(b). 
The set B(a,¢) is open, so its complement is closed. Therefore C = A \ B(a,e) = AN 
(F \ B(a,e)) is closed. As C C A, then C is also bounded. Thus by Theorem 5.2.1, 
{ f(a): a € C} is a closed and bounded subset of F’. By injectivity of f, this set does not 
contain b = f(a). The complement of this set contains b and is open, so that there exists 
6 > 0 such that B(b,d) C F\ {f(a) : « € C}. Now let y € B with |y — b| < 6. Since f 
is invertible, there exists x € A such that y = f(x). By the choice of 6, x ¢ C, so that 
f-\(y) =2 € B(a,e) = B(f71(b),€). In short, for every € > 0 there exists 5 > 0 such that 
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for all y € B, if |y —b| < 6, then | f~!(y) — f~!(a)| < €. This proves that f~! is continuous 
at b. 


Exercises for Section 5.2 


5.2.1. (Compare with Theorem 5.2.1.) 
i) Show that f : (0,1) > R defined by f(x) = 4 is a continuous function whose 


domain is open and bounded but the range is not bounded. 
ii) Show that f : (1,1) — R defined by f(x) = x? is a continuous function whose 
domain is open and bounded but the range is not open. 
5.2.2. Consider the continuous function f in Example 5.1.6. 
i) Prove that f has an inverse function f~! : (—1,1) > (—1,0) U (0, 1]. 
ii) Graph f~! and prove that f~! is not continuous. 


iii) Compare with Theorem 5.2.4. 
5.2.3. Define f: {x € C: |z| < 1 or |x| > 2} > C by 


fe)={4 if fel < 13 


5x, otherwise. 


Prove that f is continuous and invertible but that f~! is not continuous. Compare with 
Theorem 5.2.4. 


5.2.4. Let A be a closed and bounded subset of C, and let f : A + C be continuous. 
i) Why are we not allowed to talk about f achieving its maximum or minimum? 
ii) Can we talk about the maximum and minimum absolute values of f? Justify your 
answer. 
iii) Can we talk about the maximum and minimum of the real or of the imaginary 
components of f? Justify your answer. 
*5.2.5. Let F be R or C, and let A and B be subsets of F. Prove that f : A > B is 
continuous if and only if for every open subset U of F' there exists an open subset V of F 
such that the set {x € A: f(x) EU} =VOA. 
5.2.6. Let f : (0,1) > R be defined by f(x) = 4. Prove that f is continuous and that f 
has neither a minimum nor a maximum. Explain why this does not contradict the Extreme 


value theorem (Theorem 5.2.2). 


5.2.7. (Outline of another proof of the Extreme value theorem for closed intervals in R, due 
to Samuel J. Ferguson, “A one-sentence line-of-sight proof of the Extreme value theorem” , 
American Mathematical Monthly 121 (2014), page 331.) Let f : [a,b] — R be a continuous 
function. The goal is to prove that f achieves its maximum at a point u € [a,b]. Set 
DL = {z € [a,b]: for all y € [a, 5], if y < x then f(y) < f(x)}. 
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i) Prove that a € L. 


ii) Let c € Bd(L) (boundary of L). Prove that c € [a, }}. 


iii) Suppose that c ¢ L. 


— Prove that there exists y € [a,c) such that f(y) > f(c). 

— Prove that there exists 6 > 0 such that for all x € [a,b] MN B(c, 6), | f(x) — f(c)| < 
(f(y) — f(c))/2. (Hint: use continuity at c.) 

— Prove that there exists x € LM B(c, min{d,c — y}). 


— Prove that f(y) — f(x) > 0. 


— Prove that y > x. 


— Prove that c— y > |x —c| >c-— a >c-—y, which is a contradiction. 
iv) Conclude that c € L and that L is a closed set. 


v) Let wu = sup(Z). Prove that u exists and is an element of L. 
vi) Let k > f(u). Prove that the set S, = {x © |c,b] : f(y) => k} is closed, and in 
particular that 5S;,, if non-empty, has a minimum. 


vii) Suppose that for some x € [a,b], f(x) > f(u). 
— Prove that x >uand a ¢ L. 
— Prove that there exists x, € (u,x) such that f(x) > f(x). 
— Prove that there exist x1 € (u, x), v2 € (u, 21), £3 € (u, 22), ..., In E (U, Ln_1), 
vg such that - Sf (a),) > Fact) te} SF is) F(a) Sf aa) ta) 


flu). 


— Prove that the set Sy, does not have a minimum. 


viii) Conclude that f(u) is the maximum of f on |a, }j. 


5.2.8. Let A be a subset of R, and let f : A — R be a function with a maximum at 
u € A. What can you say about the slope of the line between (a, f(a)) and (u, f(w)) for 


ae A\{u}? 


5.3 Intermediate value theorem 


In this section, all functions are real-valued. The reason is that we can only make 


comparisons in an ordered field, and 
(Exercise 3.9.6). 


R is an ordered field (Theorem 3.8.2), and C is not 
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Theorem 5.3.1. (Intermediate value theorem) Let a,b € R with b > a, and let 
f : [a,b] > R be continuous. Let k be a real number strictly between f(a) and f(b). Then 
there exists c € (a,b) such that f(c) =k. 


Here is a picture that illustrates the Intermediate value theorem: for value k on the y- 
axis between f(a) and f(b) there happen to be two c for which f(c) = k. The Intermediate 
value theorem guarantees that one such c exists but does not say how many such c exist. 


Proof of Theorem 5.3.1: Set a9 = a and bo = b. Apply Construction 3.14.1 of halving 
intervals, with the property P of intervals being that k is strictly between the values of f 
at the two endpoints. If for any n > 0, f =) = k, then the theorem is proved and 
we can immediately stop the proof (and the construction). 

When k; is not equal to a then since k is between f(an—1) and f(bn_1), 
necessarily k is between f(a,_1) and f(a feet) or else between f( and 
f(bn—1). Choose that half [an, bn] of [@n—1,bn—1] which says that k is strictly between 
f(an) and f(b,). Thus for all n, k is strictly between f(a,) and f(b,). By construction 
c = sup{a1, G2, a3,...} = inf {by, bo, b3,...} is in [a,b]. So c is in the domain of f. 

We will prove that for all e > 0, |f(c) —k| < «. Let € > 0. Since f is continuous, 
it is continuous at c, so there exists 6 > 0 such that for all x € [a,b], if ja —c| < 6, 
then |f(x) — f(c)| < €/3. By Exercise 3.8.3, there exists a positive integer n such that 
1/2” < 6/(b—a). As an <c < by, we have ja, —c| < jan — bn| = (b — a)/2” < 6, so 
that |f(an) — f(c)| < €/3. Similarly, | f(b.) — f(c)| < €/3. Hence by the triangle inequality, 
|f(bn) — f(an)| < 26/3. But k is between f(a,) and f(b,), and thus both |f(a,) — k| and 
|f(b,) — k| must be less than 2€/3. Therefore 


If(o) — kl < |f(e) — Flan)| + [fF (an) — kl <e. 
Since this true for all « > 0, it follows by Theorem 2.10.4 that f(c) =k. 


n—- tbn— 
a 5 1) 


An important application of this theorem is in the next section, introducing the 
radical functions. (So far we have sporadically used the square roots only, relying on some 
facts from high school that we have not yet proved.) 
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c7—2 
c2+2° 


Example 5.3.2. There exists a real number c such that c? — 4 = 


Proof. Let f : R + R be defined by f(x) = 2° —4— ae This function is a rational 
function and defined for all real numbers, so that by Theorem 5.1.3, f is continuous. Note 
that f(0) = -3 < 0 < f(2), so that by Theorem 5.3.1 there exists c in (0,2) such that 


c7-2 
f(c) =0. In other words, c? — 4 = S55. 


Theorem 5.3.3. Let I be an interval in R, and let f : I — R be continuous. Then the 
image of f is an interval in R. 


Proof. For any c and d in the image of f, by the Intermediate value theorem (The- 
orem 5.3.1), any real number between c and d is in the image of f, which proves the 


theorem. 


However, if f : A > B is continuous and injective and A is open, then the range of 
f is open in B. We prove this first for A and B subsets of R, and then for subsets of C. 


Compare the next theorem to Example 5.1.6. 


Theorem 5.3.4. Let I be an interval in R, B a subset of R, and let f : I > B be 
continuous and invertible. Then f~' is continuous and f,f~' are either both strictly 


increasing or both strictly decreasing. Furthermore, if I is open, so is B. 


Proof. Let a < b be in I. Since f has an inverse, f(a) 4 f(b), so that by trichotomy, either 
f(a) < f(b) or f(a) > f (0). 

For now we assume that f(a) < f(b). With that we prove that f is an increasing 
function, i.e., that for any x,y € I, if x < y then f(x) < f(y). First suppose that 7 < y < z 
are in J. By invertibility, f(x), f(y), f(z) are distinct. If f(x) is between f(y) and f(z), then 
an application of the Intermediate value theorem gives c € (y,z) such that f(c) = f(z). 
But « < y <c, so x and care distinct, and f(c) = f(x) contradicts invertibility of f. So 
f(x) is not between f(y) and f(z), and similarly f(z) is not between f(x) and f(y). Thus 
necessarily f(x) < f(y) < f(z) or f(x) > fly) > f(z). By setting x = a and z = b we 
get that f is increasing on [a,b], by setting y= a and z = b we get that f is increasing on 
IN (—ov, a], and by setting x = a and y = b we get that f is increasing on JM [b, oo). Thus 
f is increasing on I. 

By Theorem 5.3.3 we know that B is an interval. By definition of inverses, f~' is 
strictly increasing. 

If J is in addition open, let c € B and a = f~‘(c). Since J is open and f is 
increasing, there exist b1,b2 € A such that b) < a < bg. Thus f(bi) < f(a) =c < f(b), 
and by Theorem 5.3.3 we know that (f(b1), f(b2)) is an open subset of B that contains c. 
Thus B is open. 
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Now we prove that f~! is a continuous function at an arbitrary y be in the domain 
of f-?. Let e > 0. Let c= f(y). Let z € (c,c +e) MI. (Of course, if c is the right 
endpoint of J, there is no such z.) Since f is increasing, y = f(c) < f(z) and we set 
6, = f(z) —y. Then for any z € (y,y+ 61) = (f(c), f(z)), by the Intermediate value 
theorem z is in the range of f, i.e., x is in the domain of f~', and since f~' is increasing, 
ea f Gy) < fe) = £-G@)). = &.<ee. “Thus |f-4G@)— {7 @)| =e “This 
proves that for the given « > 0 there exists 6; > 0 such that for any x € (y,y+01) NJ, 
|f-(x) — f-t(y)| < e. Similarly, there exists 62 > 0 such that for all x € (y — 62,y) NI, 
lf-(y) — f-\(2)| < e. Now set 6 = min{d,, 52}. By what we just proved, for all x in the 
domain of f~1, if jz — y| < 6, then |f~!(x) — f-!(y)| < e. Thus f~! is continuous at y, 
and since y was arbitrary, f~! is continuous. 

Finally, suppose that f(a) > f(b) for some a < bin I. Set g = —f. Then g is 
invertible and continuous, g(a) < g(b), and by the work so far we know that g is a strictly 
increasing function with a continuous strictly increasing inverse g~!. Thus f = —g is a 


strictly decreasing function, and f~! = —g~! is a continuous strictly decreasing function. 


In addition, if J is open, so is B. 


Exercises for Section 5.3 


5.3.1. Let f(z) = 2° +4, g(x) = x4 +27. Prove that there exists c € [—4,4] such that 
fo =g9(©). 


5.3.2. Let f : R— Q be a continuous function. Prove that f is a constant function. 


i) Prove that any continuous function from C to Q is constant. 


ii) Does a continuous function f : Q— R have to be constant? 


5.3.3. (Fixed point theorem) Let f : [a,b] — [a,b] be a continuous function. Prove that 
there exists c € [a, b] such that f(c) =. 


5.3.4. Find real numbers a < b and a continuous function f : [a,b] > [a,b] such that 
f(c) #¢ for all c € (a,b). 


5.3.5. Find real numbers a < 6 and a continuous function f : (a,b) > (a,b) such that 
f(c) #¢ for all c € (a,b). 
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5.3.6. The goal of this exercise is to prove that every polynomial of odd degree has a real 
root. Write f(X) = ag +a,X +a2X7+-+-+a,X" for some real numbers ag, a1, 42,..., An 
such that n is odd and ay # 0. Set b= 7H max{|ao|, |ai|,---, |@n—1], |@n|}- 
i) Prove that b is a positive real number. If f(b) = 0 or if f(—b) = 0, we have found 
the root. So we may assume that f(b) and f(—b) are not zero. 
ii) Justify all steps below: 


|ao + a,(+b) + a2(+b)? Baas Gn—1(+b)""+| 


< |ao| + |arb| + |agb?| +--+ + Jan_1b"—*| 
(by the triangle inequality) 
nd |b2 7|b° alee 

= z a: | fa | area (by the choice of b) 
n n n 
n|b =. 

a a Gey ee eee a hae 
n 
n|D x 

=< ela Be Ge Be Feats ty (because b > 1) 
m 0) 1 2 n—-1 (place markers) 
n 

= |a,,|b” 

= |ay,0 | 


iii) Justify all steps below: 
| f(£b)| = |ao +1 (£0) +02 (4b)? +--+ +@n—1(£b)"*+an(£0)"| 
> |an(+b)"|—|ag+ay(+b)+a2(+b)?+---+an—1(+b)"—"| 
= 0. 
iv) Prove that f(b) has the same sign (positive or negative) as a,b” and that f(—b) 
has the same sign as a,(—b)”. 


v) Prove that f(b) and f(—b) have opposite signs. 
vi) Prove that f has a real root in (—b,b). 


5.4 Radical functions 


Let n be a positive integer. Define the function f(x) = x” with domain R if n is 
odd and domain Ryo otherwise. In this section we re-prove Theorem 3.8.10 that the nth 


radical function exists, and more. 
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Theorem 5.4.1. The range of f is R ifn is odd and is Ryo otherwise. 


Proof. Certainly 0 = 0” is in the range. If a > 0, then 
0° =0<a<a+1<(a+1)”. 


The last inequality is by Exercise 2.8.1. Since exponentiation by n is a polynomial function, 
it is continuous, and so by the Intermediate value theorem (Theorem 5.3.1), there exists 
r € (0,a+1) such that a =r”. Thus every non-negative real number is in the range of f. 
If a < 0 and n is odd, then similarly 


(a—1)"<a-1<a<0=0". 


The Intermediate value theorem guarantees that there exists r € (a—1,0) such that a=r”. 
So for odd n all real numbers are in the range. If n is even, then n/2 is an integer and 


for any « € R, «” = (x?)"/? is a positive-integer power of x7. By Theorem 2.7.12, 2? is 
positive, and so by Theorem 2.9.2, 2” € R* U {0}. 


We now re-prove Theorem 3.8.10 that the nth radical function exists when n is a 
positive integer. Let f be as in the theorem above. By Theorem 2.9.2 and Exercise 2.9.1, 
f is strictly increasing. Thus by Theorem 2.9.4, f has an inverse function f~'. By the power 
rule, f is continuous, so that by Theorem 5.3.4, f~! is strictly increasing and continuous. 
We call this inverse the nth radical function. For any a in its domain we write f(a) = Va 
or as f(a) = a!/”, and we call this value as the nth root of a. 


We just established that for any positive integer n, </ is defined on Rso, or even on 


R if n is odd. We also just established that ~/ is a strictly increasing continuous function. 


By continuity we immediately get the following: 


Theorem 5.4.2. (Radical rule for limits) For any positive integer n and any a in the 
domain of ~/ , 


lim Vx = Ya. 


ra 


Now let m and n be positive integers. For any x € R* U{0}, the elements (/x™ and 
(%/x)™ are well-defined in R* U {0}. Their nth powers are the same value 2”: 


(Var) aa" 


by definition, and 


Thus by uniqueness established in Corollary 2.9.3, 


Var = (Ya) 
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We record this function as "/%// = ( )'/”, and call it the exponentiation by m/n. This 
shows that we can handle rational exponents as well. 


Theorem 5.4.3. (Exponentiation by rational exponents is continuous.) Let r € Q. 
If r > 0 let A = Ryo and if r < 0 let A=R*. Let f : A— A be defined by f(r) = 2". 
Then f is continuous. 

(1) Ifr > 0, then f is strictly increasing. 

(2) Ifr <0, then f is strictly decreasing. 

(3) Ifr #0 then f is invertible and its inverse is exponentiation by 1/r. 

(4) Ifr =0, then f is a constant function. 


Proof. Write r = m/n, where n and m are integers and n 4 0. Since m/n = (—m)/(—n), 
by possibly multiplying by —1 we may assume that n > 0. Then f is a composition of 
exponentiation by m with exponentiation by 1/n, in either order. Exponentiation by non- 
negative m is continuous by the constant or power rule, exponentiation by negative m is 
continuous by the quotient rule, and exponentiation by 1/n is continuous by Theorem 5.4.2. 
Thus f is continuous by the composite rule. 

If r > 0, then m,n > 0, and then f is the composition of two strictly increasing 
functions, hence strictly increasing. If r < 0, then m < 0 and n> 0, so f is the composition 
of a strictly increasing and a strictly decreasing function, hence strictly decreasing by 
Exercise 2.9.7. 

For any non-zero integer p, let gp, hp» : A — A be defined as g,(x) = x? and h,(x) = 
z'/P, By associativity of function composition (Exercise 2.4.6), 


(f(a))"/™ = Bm © 9n(f(a)) 
=Nm © gn(hn © Im(a)) 
= (Rm © (Gn 2 hn))(Gm(@)) 
=i OG 4, (8) 
=a, 
and similarly f(a"/™) = a. This proves that exponentiation by n/m = 1/r is the inverse 
of f. 
The last part is obvious. 


Theorem 7.6.4 shows more generally that exponentiation by arbitrary real numbers 
(not just by rational numbers) is continuous. 


Exercises for Section 5.4 
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5.4.1. Let n,m € Q, and suppose that a and 6 are in the domain of exponentiation by n 
and m. Prove: 

i) a” -b" = (a ve 
ii) . an =a 

iii) a Sere 
iv) Ifa i 0, then a~” = 1/a”. 


1 


5.4.2. Determine the following limits, and justify all steps by invoking the relevant theo- 
rems/rules: 

i) Tim, Va? — 3244. 
mn MBa V2 


an gt 4A : 
x—2 


5.4.3. Let f(x) = /z -— V—-2. 
i) Prove that for all a in the domain of f, lim,-,, f(x) is inapplicable. 


ii) Prove that f is continuous. 
5.4.4. Determine the domain of the function f given by f(x) = /—2?. 


5.4.5. Here is an alternate proof of Theorem 5.4.2. Study the proof, and provide any 
missing commentary. Let A be the domain of v/ . 
i) Prove that an element of A is a limit point of A. 
ii) Suppose that a = 0. Set 6 = €”. Let x € A satisfy 0 < |x — a] < 6. Since the nth 
root function is an increasing function on A, it follows that 


| Ye — Ya| =| 7/2] = Vial < Vo=e. 
iii) Suppose that a > 0. First let &’ = min{e, Ya}. So é’ is a positive number. Set 

6=min{(e+ v/a)” —a,a—(V/a" —)"}. Note that 

(e+ Va)" -—a>(Va)"—-a=0, 

0 < Va — < Va, 

a —(Va" —')" >0, 
so that 6 is positive. Let x € A satisfy 0 < |x —a| < 6. Then —6 <2—a< 6 and 
a-—d<xz<6d+a. Since 6 = min{(e+ %/a)” —a,a—(%/a" — e)"}, it follows that 


a—(a-(Va"—e)") <a-d<a<atd<at((e+ Ya)" —a), 


or in other words, (%/a — €')” <a < (e+ Wa)”. Since the nth radical function is 
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increasing on R*, it follows that 
Ya—e< Ya-e < Ya<et+ Va, 


whence —€ < ~/x — Wa < «, so that | */x — Wa] < «, as desired. 
iv) Assume that a is negative. Then necessarily n is an odd integer. By what we have 


proved for positive numbers in the domain, such as for —a, there exists 6 > 0 such 
that for all x € R, if 0 < |(—x) — (—a)| < 6, then | 7/—x — Y—a] < e. But then for 
x € R with 0 < |x —a| < 0, since n is odd, 


| Ve — Wal =|(-1) (We - Va)| =|V—2—- Y-al <e. 


5.4.6. Recall from Exercise 3.12.4 that for every non-zero complex number a there exist 


exactly two complex numbers whose squares are a. Let’s try to create a continuous square 
root function f : C > C. (We will fail.) 

i) Say that for all a in the first quadrant we choose f(a) in the first quadrant. Where 
are then f(a) for a in the remaining quadrants? 

ii) Is it possible to extend this square root function to a function f on all of C 
(the positive and negative real and imaginary axes) in such a way as to make 
lim, +4 f(x) = f(a) for alla € C? 

iii) Explain away the problematic claims /—4- /—9 = \/(—4)(—9) = V36 = 6, 
J/—4-/—9 = 21 - 34 = —6 from page 9. 

iv) Let D be the set of all complex numbers that are not on the negative real axis. 


Prove that we can define a continuous square root function f : D— C. 

(v)* Let 6 be any real number, and let D be the set of all complex numbers whose 
counterclockwise angle from the positive real axis is 6. Prove that we can define a 
continuous square root function f : D— C. 

5.4.7. (The goal of this exercise and the next one is to develop exponential functions 
without derivatives and integrals. We will see in Section 7.4 that derivatives and integrals 
give a more elegant approach.) Let c € (1,00) and let f : Q — R™ be the function 
fase: 


i) Why is f a function? (Is it well-defined, i.e., are we allowed to raise positive real 


numbers to rational exponents?) 
ii) Prove that f is strictly increasing. (Hint: Theorem 5.4.3.) 
iii) Let « > 0. Justify the following: 
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Use the Archimedean property of R (Theorem 3.8.3) to prove that the set {(e+1)” : 
n € No} is not bounded above. 


iv) Prove that there exists a positive integer n such that ea eae oO 

v) Prove that there exists 6; > 0 such that for all x € (0,61) NQ, |e”? —1] <. 
vi) Prove that there exists 62 > 0 such that for all x € (—d2,0)NQ, |c” —1| <«. 
vii) Prove that limz_,o,2e«qc” = 1. 
) 


vili) Prove that for any r € R, limz-,;,reqc” exists and is a real number. 


5.4.8. (Related to the previous exercise.) Let c € R*. 


i) Prove that for any r € R, lim,_,,,,eqc” exists and is a real number. (Hint: Case 


c = 1 is special; case c > 1 done; relate the case c < 1 to the case c > 1 and the 
quotient rule for limits.) 

ii) We denote the limit in the previous part with c”. Prove that for all real numbers 
C,C1,C2,7,71,72, With c,c1,c2 > 0, 


= 1 
i (ie) =COoe Se eae 


ae 


iii) Prove that the function g : R > R given by g(x) = c” is continuous. (This is easy.) 


5.5 Uniform continuity 


Definition 5.5.1. A function f is uniformly continuous if for all real numbers € > 0 
there exists a real number 6 > 0 such that for all x and y in the domain, if |x — y| < 6, 


then |f(x) — f(y)| <«. 


For example, constant functions are uniformly continuous. 

In uniform continuity, given a real number ¢ > 0, there exists 6 > 0 that depends 
only on € that makes some conclusion true, whereas in the definition of continuity at a, 
given a real number ¢ > 0, there exists 6 > 0 that depends on ¢€ and on a for the same 
conclusion to be true (with y = a). Thus the following is immediate: 


Theorem 5.5.2. Every uniformly continuous function is continuous. 


The converse is false in general: 
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ot 
f—ts 
n 


Example 5.5.3. Let f(x) = x7, with domain C. Since f is a polynomial function, i 


continuous. Suppose that f is also uniformly continuous. Then in particular for € 
there exists 6 > 0 such that for all x,y € A, if |a—y| < 6 then | f(x) — f(y)| < 1. Set x 
and y=a+ 2. Then |z — y| = 6/2 < 6 and 


Ole pe 


If(z) — f@)| = |2? -y"| = 


5? 52 
— =] — 1= 7 
s+ S| aoe > € 


This proves the negation of the definition of uniform continuity. 


The converse of Theorem 5.5.2 holds with some extra hypotheses. 


Theorem 5.5.4. Let A,B be closed and bounded subsets of R or C. Let f : A + B be 
continuous. Then f is uniformly continuous. 


Proof. Let « > 0. Since f is continuous, for each a € A there exists 6, > 0 such that for 
all c € A, |x — a] < dq implies that |f(x) — f(a)| < €/2. Note that A C UseaB(a, dq). 
By Theorem 3.14.5 there exists 6 > 0 such that for all 2 € A there exists a € A such that 
B(a,6) C Bia, dq). Let x,y € A with |x—y| < 6. Since z, y € B(x, 5) C B(a, dq), it follows 
that |x — al, |y —a| < 6,. Thus 


If(@) — Fy) = [F(@) — fla) + F(@) — Fy)! 


< |f(x) — f(a)|+ | f(a) — f(y)| (by the triangle inequality) 
<¢€/2+6/2=€. 


Example 5.5.5. The continuous function \/ is uniformly continuous. 


Proof. We established in Section 5.4 that ./ is continuous. Let « > 0. We divide the 
domain into two regions, one closed and bounded so we can invoke the theorem above, and 
the other unbounded but where \/- has a bounded derivative. 

The first region is the closed and bounded interval [0,2]. By the previous theorem 
there exists 6; > 0 such that for all a,x € (0, 2], if |x — a] < 6) then |,/z — Va] <e. 

The second region is the unbounded interval [1, 00). For a, x € [1, 00) with |x—a] < «€ 


we have 


Nix ve + va r-a 


(because \/x, \/a > 1) 


Iz — al 
< 
-_ 2 
SIGs 


Finally, set 6 = min{6,,¢,1}. Let a and x be in the domain of \/ such that |x—a] < 6. 
Since |x — a| < 6 < 1, necessarily either x,a € [0,2] or z,a < [1,00). We have analyzed 
both cases, and we conclude that |,/x — V/a| < e. 
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Theorem 5.5.6. Let f : A — B be uniformly continuous and C a subset of A. Let 
g:C > B be defined as g(x) = f(x). Then g is uniformly continuous. 


Proof. Let « > 0. Since f is uniformly continuous, there exists 6 > 0 such that for all 
x,y € A, if |x — y| < 6 then |f(x) — f(y)| < ¢. But then for any z,y € C, if |x —y| < 6, 
then |g(a) — g(y)| = f(a) — fl <e. 


Example 5.5.7. By Example 5.5.3 the squaring function is not uniformly continuous on 


C or R, but when the domain is restricted to any bounded subset of C, that domain is a 
subset of a closed and bounded subset of C, and so since squaring is continuous, it follows 
by the previous theorem that squaring is uniformly continuous on the closed and bounded 
set and hence on any subset of that. 


Exercises for Section 5.5 
5.5.1. Let f :C > C be given by f(x) = mx+1 for some constants m,/. Prove that f is 
uniformly continuous. 


5.5.2. Which of the following functions are uniformly continuous? Justify your answers. 
i) f: BOO,1) > C, f(x) =2". 

i) f: 0,1] >R, fla Vane 

i) 

) 


li 


f:ROR, f(@)= 
f:R\{0} ~R, F(a) 
f:ROR, fa) = | Fr ee 


iii 


+1° 
—. ee 


iv : 
|| 


0, otherwise. 


v) 

1, if x is rational; 
vi) f: R>R, f(z) = Ma otherwise. 
vii) Re, Im: C>R. 


viii) The absolute value function from C to R 


5.5.3. Suppose that f,g:A— C are uniformly continuous and that c € C. 
i) Prove that cf and f +g are uniformly continuous. 
ii) Is f - g uniformly continuous? Prove or give a counterexample. 
5.5.4. Let f : (a,b) + C be uniformly continuous. Prove that there exists a continuous 
function g : [a,b] > C such that f(x) = g(x) for all x € (a,b). 
5.5.5. Is the composition of two uniformly continuous functions uniformly continuous? 


Prove or give a counterexample. 
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5.5.6. Let f :C — C be defined by f(x) = 2°. 
i) Prove that f is continuous but not uniformly continuous. 
ii) Find a uniformly continuous function g : C > C such that go f is uniformly 
continuous. 
iii) Find a uniformly continuous function g : C > C such that go f is not uniformly 


continuous. 


Chapter 6: Differentiation 


The geometric motivation for differentiation comes from lines tangent to a graph of 
a function f at a point (a, f(a)). For example, on the graph below are two gray secant 
lines through (a, f(a)): 


ae 


— 


It appears that the line through (a, f(a)) and (z, f(x)) is closer to the tangent line to the 
graph of f at (a, f(a)) if x is closer to a. Intuitively, the slope of the tangent line is the 
limit of the slopes of the secant lines. 


6.1 Definition of derivatives 


Definition 6.1.1. Let A CC, and let a € A be a limit point of A. A function f : A—> C 
is differentiable at a if 

vam £2) =f) 

im —7——— 


mrL—-a x—-a 


exists. In this case, we call this limit the derivative of f at a, and we use either Newton’s 
df (x) 


notation f'(a) = Gaye. or Leibniz’s notation 4 (a) = ae lees 


A function is differentiable if it is differentiable at all points in its domain. 


An alternative way of computing the derivatives is 


pay = #O| tim Lot -F@ 


dx peas h->0 h 


as this is simply a matter of writing x as a+ h, and using that a+h=2—- a if and only 
if h > 0. (See Exercise 5.1.6.) 
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Example 6.1.2. Let f(x) = ma +1, where m and / are complex numbers. Then for any 


aeC, 
_ f(x) — f(a) 
— | 
f(a) = im —— 
i (ma +1) — (ma+1) 
ra xX—-a 
im Oe a) 
ra xX—-a 
=i: 
Alternatively, 
ist et AG ER) LG) 
f (a) = es h 
eh (m(a +h) +12) — (matt) 
 h-¥0 h 
ieee 
~ oo A 
= 


Alternatively, 


Section 6.1: Definition of derivatives 
= 2a. 
From now on, we mostly use the alternative notation. 


Example 6.1.4. Let f(x) = 1/x. Then 


(common denominator in the fractions) 


—1 
lim. ————_~ 
h+0 a(a + h) 
—1 
Qe 
by the quotient rule for limits. 


Example 6.1.5. Let f(a) = ,/z. The domain of f is Ryo, and for all a > 0, 


f(a) = jim HOt Fle) 
ae Vat+th—v/a 
= 0 h 
= Vath-Ja Vath+vVa 
bh h Jath+a/a 
= lim UU ek since (x — y)(x Sosa 
eee (c—y)(@t+y) y”) 


h 
= lim 
h0 A(Va+h+ /a) 


1 
= ini — 
h>0 Vath+/a 
1 
D/a 
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by the linear, radical, composite, and quotient rules for limits. It is left to Exercise 6.1.4 


to show that f is not differentiable at 0. 
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Example 6.1.6. Let f(z) = x°/?. The domain of f is Ryo, and for all a > 0, 
fla+h) — f(a) 


, 
eC as st h 
ae (a + h)3/? — a3/? 
h—0 h 
ajies (a + h)3/? — a3/? (a+ h)8/? + a3/2 
h-0 h (a + h)3/2 4+ q3/2 
= lim ah) ea (since (2 — y)(a + y) = x? — y?) 


h—0 h((a + h)3/2 + a3/2) 
ds a? Ba" ht Sah? hs = a2 

hoo A((a + h)3/? + 3/2) 
meee (3a? + 3ah + h?7)h 

h—0 h((a + h)3/2 + a3/2) 
eee Sa? 4 Sah he 

ho (a + h)3/2 + @3/2 


2 ‘. 
EET ) if a > 0; 
him = lint = 0. ikea = 0; 
h>0 h3/2 h—0 : , 
3 
sie 2 
2 
by the linear, radical, composite, and quotient rules for limits. (Note that this f is differen- 
tiable even at 0, whereas the square root function (previous example) is not differentiable 


at 0.) 


Note that in all these examples, f’ is a function from some subset of the domain of 
f to a subset of C, and we can compute f’ at a number labeled x rather than a: 


f(a) — FO — jin FO+M-FO) _ 1, FO- FO 


dx h0 h 22 2-2 


The h-limit is perhaps preferable to the last limit, where it is z that varies and gets closer 
and closer to 2. 


Example 6.1.7. The absolute value function is not differentiable at 0. 


Proof. This function is not differentiable whether the domain is C or R. The reason is 


that the limit of Jal lol as h goes to 0 does not exist. Namely, if h varies over positive real 


numbers, this limit is 1, and if h varies over negative real numbers, the limit is —1, so that 


the limit indeed does not exist, and hence the derivative does not exist. 


This gives an example of a continuous function that is not differentiable. (Any 
continuous function with a jagged graph is not differentiable.) 
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Exercises for Section 6.1 

6.1.1. Prove that f : C > C given by f(x) = x? is differentiable everywhere, and compute 
the derivative function. 

6.1.2. Prove that f : C \ {0} > C given by f(x) = 1/2? is differentiable everywhere, and 
compute the derivative function. 

6.1.3. Prove that f : (0,co) — R given by f(x) = 1/,/z is differentiable everywhere, and 
compute the derivative function. 


6.1.4. Prove that the square root function is not differentiable at 0. 


6.1.5. Prove that the function f : [0,00) > R given by f(x) = 2x°/? is differentiable, 
including at 0. 


6.1.6. Let f : R—R be given by 


_fa2?-1, ifg>1, 
oes it ed, 


Prove that f is differentiable at 1. 
6.1.7. Let f : RR be given by 


pers ata ds 
Oe ee if i. 


Prove that f is continuous but not differentiable at 1. 


6.1.8. Let f : R—R be given by 
fe) = {3 ie SS 1 
= 3 


w—a, ifa<l. 


Prove that f is not continuous and not differentiable at 1. 


ee . sft ty de 1a . ; ee ee 
6.1.9. Determine if the following function is differentiable at 1: f(x) = { oe, ife<1 
Lao ee 
; ee if ¢ > 0; _ Je +3, ifx>0; 
6.1.10. Let f,g: R— Rbe f(z) ee fc <0, and g(x) re fx <0 


i) Prove that f is differentiable everywhere and that for all x € R, f’(x) = |a|. 


ii) Prove that g is not differentiable at 0. 
iii) Prove that g is differentiable at any non-zero real number «x with g’(x) = ||. 


6.1.11. Let f : (0,1) — R be the square root function. Verify that f is differentiable, 
bounded, even uniformly continuous, and that f’ is not bounded. (Hint: Example 6.1.5.) 


6.2 Basic properties of derivatives 
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Theorem 6.2.1. If f is differentiable at a, then f is continuous at a. 


Proof. By definition of differentiability, a is a limit point of the domain of f and a is in the 


f(ath)—f(@) 
h 


domain of f. Furthermore, since limp_,9 exists, by the product rule for limits 


also limp +(hhte-1@ ) exists and equals 0 - f’(a) = 0. In other words, limp,_.9(f(a@ + 
h) — f(a)) = 0, so that by the sum rule for limits, lim,9 f(a + h) = limp+o(f(a + h) — 
f(a) + f(a)) = 0+ f(a) = f(a). Thus by Exercise 5.1.6, lim, ., f(x) = f(a), so that f is 


continuous at a. 


Theorem 6.2.2. (Basic properties of derivatives) 
(1) (Constant rule) Constant functions are differentiable (at all real/complex numbers) 
and the derivative is 0 everywhere. 
(2) (Linear rule) The function f(x) = x is differentiable (at all real/complex numbers) 
and the derivative is 1 everywhere. 


Let A be a subset of C and let a € A be a limit point of A. Suppose that f,g: A — C are 
differentiable at a. Then 
(3) (Scalar rule) For any c € C, cf is differentiable at a and (cf)'(a) =c 
(4) (Sum/difference rule) f + g is differentiable at a and (f + 9)'(a) = f'(a 


)4g'(a). 
(5) (Product rule) f - g is differentiable at a and (f - g)'(a) = f'(a)g(a) + f(a)g’(a). 
(6) (Quotient rule) If g(a) # 0, then f/g is differentiable at a and (f/g)/(a) = 


f'(a)9(a)— f (a)g" (a) 
(g(a))? : 


Proof. Parts (1) and (2) were already proved in part (1) of Example 6.1.2. 
(3) follows from 


tim (CA h) = (cf)(h) _ 4, cha +h) = ef (h) 
ae h h—+0 h 
~ elim Sat = Fh) 
h-0 h 


= of (a); 
and (4) follows from the sum rule for limits and from 


(ft+g)(a+h)—(f +g)(h) 


me h 
soon LGR) + ola +h) = fh) = 9(h) 
h—0 h 
ven LGR) = Hh) + 9a +h) ~ g(h) 
h—0 h 
; flat+th)—f(h g(a th) —g(h 
= jim, (4 ) (A) of ) 2) 
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The following proves the product rule (5): 


fan (f9)(0+h)~(f-9)(a) _,, fla+h)-9(a +h) ~ Fla) g(a) 
h—0 h h-0 h 
flathjg(at+h) — flajgla +h) + flajg(a +h) — F(a)g(a) 
h-0 h 
(by addition of a clever 0) 
= jn, (Let = Sole + 1+ Figo) — fa) 
h-0 h 


(by re-writing) 
= jy (Het = 1 
= lin 
h-0 h 
= f'(a)g(a) + flag’), 
where in the last step we used that f and g are differentiable at a and that g is continuous 
at a (by Theorem 6.2.1). 


g(a +h) + f(a) 


gla +h) — g(@) 
i) 


The proof of the quotient rule (6) is similar. It is written out next without a com- 
mentary; the reader is encouraged to supply justification for each step: 


5 GOP 58) = Gem ae 
h—0 h h—0 h 
sam f+ Mala) — F(agla +h) 
h—0 hg(a + h)g(a) 
_ un Lat Wala) = flagla) + Fa)g(a) = lagla+h) 
h—0 hg(a+ h)g(a) 
— tim Let gla +h) — Flajgla + h) + fla)gla) — fla)gla + b) 
h—0 hg(a + h)g(a) 
Lie (= t+th)gia+h)—flagla+h) — flajglath)— ee 
h—+0 hg(a+ h)g(a) hg(a + h)g(a) 
2G PO ae) 
h—0 h g(a+ h)g(a) 
f(a) gla +h) — uo) 
gla + h)g(a) h 
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Theorem 6.2.3. (Power rule) If n is a positive integer, then (x")' = na"~!. 


Proof #1: Part (1) of Example 6.1.2 with m = 1 and 1 = 0 proves that (x1)! = 1, so that 
the theorem is true when n = 1. Now suppose that the theorem holds for some positive 
integer n. Then 


x”—1)'¢ +2"—1z' (by the product rule) 


n—1)a"-?2+a"—! (by induction assumption) 


gt gt 


so that the theorem holds also for n, and so by induction also for all positive n. 


Proof #2: The second proof uses binomial expansions as in Exercise 1.7.7: 
7 (c+h)"—«2 
h>0 h 
bao (p)eene* — 2” 
h>0 h 
n—-li(n n— 
donno (een * 


Theorem 6.2.4. (Polynomial, rational function rule for derivatives) Polynomial functions 
are differentiable at all real/complex numbers and rational functions are differentiable at 
all points in the domain. 


Proof. The proof is an application of the sum, scalar, and power rules from Theorems 6.2.2 
and 6.2.3. 
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Theorem 6.2.5. (The composite function rule for derivatives, aka the chain rule) 
Suppose that f is differentiable at a, that g is differentiable at f(a), and that a is a limit 
point of the domain of go f. (If f: A> B,g:B—C , and f is differentiable at a, then 
automatically a is a limit point of A and hence of the domain of go f.) Then go f is 
differentiable at a, and (go f)/(a) = g/(f(a))- f’(a). 


Proof. Let «€ > 0. Since f is differentiable at a, there exists 6; > 0 such that for alla+h 
in the domain of f, if 0 < |h| < 61 then | lath Fa) — f'(a)| < min{1, €/(2|g'(f(a))| + 2)}. 
For all such h, by the triangle inequality, | Hate fla) < |f’(a)| +1. By assumption g is 
differentiable at f(a), so that there exists 62 > 0 such that for all x in the domain of g, if 
0 < |a—f(a)| < 62, then ee ee —g'(f(a))| < €/(2|f’(a)|4+2). Since f is differentiable 
and hence continuous at a, there exists 63 > 0 such that for all x in the domain of f, if 
|x — al < 63, then |f(x) — f(a)| < 61. Set 6 = min{6,, J, 63}. Let a+ h be arbitrary in 
the domain of go f such that 0 < |h| < 6. In particular a + h is in the domain of f. If 


f(at+h) ¢ f(a), then 


ee Ner = GD _ gpa). $0 

[ao MD =F) _ yep). $0 
_|olfla+h))— 9 FQ) f+ M—F@) — egy ary 
- hee et s'(F(@))-F(0 
_|(o(Fla+h) —9F) — ryan) fate) — Fla) 
=|(S@emaney 9) 

+ of f(a)) HEED 1) — gi F(a) Fa 

(Fath) ~ 960) — yer) | £la+ A) — Fla) 

ao elo 

+ lo H(a))|] =A — pa) 


€ 
Sy A = es 
~ 2If"(a)| +2 
€ 

2 2 


= €. 


(f(@)| + 1) + I9°(F(@)) 


A 


Thus if there exists 6 as above but possibly smaller such that f(a+h) 4 f(a) for alla+h 
in the domain with 0 < |h| < 6, the above proves the theorem. 

Now suppose that for all 6 > 0 there exists h such that a+ h is in the domain of f, 
0 < |h| < 6, and f(a+h) = f(a). Then in particular when h varies over those infinitely 
many h, f’(a) = limp_4o Flath) fe) = limp+o ? = 0. Also, for such h, (go f)(a+h) —(go 
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f)(a) = g(f(a+h)) — g(f(a)) = 0, so that 


(go f)lat+h)— (ge f)@ 
h 


This analyzes all the cases and finishes the proof of the theorem. 


g'(f(a))- f'(a)| =0<e. 


Theorem 6.2.6. (Real and imaginary parts) Let A C R, and let a € A be a limit 
point of A. Let f : A—C. Then f is differentiable at a if and only if Re f and Im f are 
differentiable at a, and in that case, f’ = (Re f)' + i(Im f)’. 


Proof. Since all h are necessarily real, 


flath) — fla) _ Re(f(a+h) — fl@)) +ilm(f(a + h) — fla) 
h h 


= Re (LOI) 4 4 (HOH FO), 


and by the definition of limits of complex functions, the limit of the function on the left 


exists if and only if the limits of its real and imaginary parts exist. 


Compare this last theorem with Exercises 6.2.4 and 6.2.5. 

Exercises 5.2.2 and 5.2.3 each give an example of a continuous invertible function 
whose inverse is not continuous. It is also true that the inverse of a differentiable invertible 
function need not be differentiable. For example, the cubing function on R (f(x) = 2°) 
is differentiable and invertible, but its inverse, the cube root function, is not differentiable 
at 0, as the limit of qOeheeve = h—"/3 does not exist as h goes to 0. Nevertheless, we can 
connect differentiability of an invertible function to the derivative of the inverse, as shown 


in the next theorem. 


Theorem 6.2.7. (Derivatives of inverses) Let A,B C C, let A be open, and let f : 
A — B be an invertible differentiable function whose derivative is never 0. Then for all 
b¢ B, f~' is differentiable at b, and 


(f-°)'() = FG)’ 


or in other words, for all a € A, f~' is differentiable at f(a) and 


Proof. Let b € B. Then b = f(a) for some a € A. Since A is open, there exists r > 0 such 
that B(a,2r) C A. Let C be any closed and bounded subset of A that contains B(a,r). 


For example, C' could be {x € C: |x -—a| <r}. Define g: C > D= {f(x):a2€C} as 


1 


g(x) = f(x). Then g is invertible and continuous. By Theorem 5.2.4, g~* is continuous. 


—1 


In particular, g~* is continuous at b = f(a) = g(a). 
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Let « > 0. Since g is differentiable at a, and g(a) = f(a) = b and g'(a) = f'(a), by 
the limit definition of derivatives, there exists 6 > 0 such that for all x € C, if | —a| < 6 
then tach — f'(a)| < e. By continuity of g~!, there exists y > 0 such that for all 
y € D, if |y—b| < y then |g~1(y) — a] = |g~1(y) — g71(b)| < 6. In particular, for all 
such y, les _ f"(a) = Bese — f"(a) < e¢. This says that for all h € C, if 


g-*(y)-a g-*(y)-a 
Jn] < y and b+h € D, then enim — f"(a) — LC area f'(a)| < «. Thus 


limp_+0 ete = f'(a), and by the quotient rule for limits, 


ey Oho 0) ee oO nae A 1 
im = lim = = — 4 
h—0 h h—0 h f(a) f"(f-*(6)) 
But this holds for every C (and g which is f restricted to C), so that in particular, 
“(b+ h) — f-*(b) 1 
-—l1\ aT f ( a, : 
ae h FO) 


It should be noted that if we know that f~! is differentiable, the proof of the last 

part of the theorem above goes as follows. For all x € B, (f o f~!)(x) = 2, so that 
(f o f~1)'(x) = 1, and by the chain rule, f’(f~!(x)) - (f71)'(x) = 1. Then if f’ is never 0, 
we get that (f—')'(x) = PUM" 
Example 6.2.8. Let f : [0,00) — [0,00) be the function f(2) = x7. We know that f 
is differentiable at all points in the domain and that f’(#) = 2x. By Example 6.1.5 and 
Exercise 6.1.4, the inverse of f, namely the square root function, is differentiable at all 
positive x, but not at 0. The theorem above applies to positive x (but not to x = 0): 


1 1 1 


Theorem 6.2.9. Let n be a positive integer. Then for all non-zero x in the domain of \/ , 


il 
(x/xy' = LS gl/n-1 
n 


Proof. Let A = R* if n is even and let A = R \ {0} otherwise. Define f : A > A to 
be f(x) = x2”. We have proved that f is invertible and differentiable. The derivative is 
f'(x) =nx"—", which is never 0. Thus by the previous theorem, f~! = ~/ is differentiable 


a a 1 _ 1 ee ee ee el 
(Ve) = Fea) = nar nae = “aa 
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Theorem 6.2.10. (Generalized power rule) Let r be an arbitrary rational number 
and let f : Rt — R* be given by f(x) = 2". Then for all x, f'(x) =ra™~!. 


Proof. The power rule and quotient rules prove this in case r an integer, and the previous 
theorem proves it in case r is one over a positive integer. Now suppose that r = m/n for 
some integers m,n with n 4 0. Since m/n = (—m)/(—n) is also a quotient of two integers, 
we may write r = m/n so that m € Z and n is a positive integer. Thus f is the composition 
of exponentiation by m and by 1/n. By the chain rule, 
f'(a) ee m(a/rym-t . 1 ajn-t 
n 
= DY a1) oF t/a 1 
n 

= ry/n—1/nt+1/n-1 


=rg"}, 


This proves the theorem for all rational r. 


The theorem also holds for all real r. But to prove it for all real r one first needs to 
define exponentiation by non-rational numbers. Such exponentiation was worked through 
laboriously in Exercises 5.4.7 and 5.4.8, and if we were to continue that kind of labori- 
ous treatment, the proof of the form of the derivative of such exponentiation would also 
be laborious. So we postpone the definition of such exponentiation and the proof of its 
derivative to Theorem 7.6.4, where with the help of integrals the definition and proofs write 
themselves elegantly. 


Exercises for Section 6.2 
6.2.1. Provide the commentary for the proof of the quotient rule in Theorem 6.2.2. 


6.2.2. Let f,g,h be differentiable. Prove that (fgh)’ = f’gh+fg'h+fgh'. More generally, 
prove a product rule formula for the derivative of the product fi - fo----- Ta 


6.2.3. Prove yet another form of the general power rule for derivatives: If f is 
differentiable at a, then for every positive integer n, f” is differentiable at a, and (f”)/(a) = 
n(f(a))"—! f’(a). (Recall that f” is the composition of n copies of f, see Remark 2.4.8.) 


6.2.4. Prove that the functions Re, Im: C > R are not differentiable at any a. 


6.2.5. (Compare with Theorem 6.2.6.) Prove that the absolute value function on R is 


differentiable at all non-zero a € R. Prove that the absolute value function on C is not 
differentiable at any non-zero a € C. (Hint: Let h = (r —1)a for r near 1.) 
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6.2.6. The following information is known: c f(c) f’(c) g(c) g’(c) 
Ge. uh 2 6 4 
Ll 0 5 3 
2 we —3 6 —6 
3. «A 2 3 5 


4 0 8 4 t 
For each of the following, either provide the derivative or argue that there is not enough 
information. In any case, justify every answer. 
i) (f -—9)'Q) = 

ii) (f-9)'Q) = 
# (£) (3) _ 

iv) (gof)' (4) = 
6.2.7. A function f is differentiable on (—2,5) and f(3) = 4, f’(3) = —1. Let g(x) = 32. 
For each of the statements below determine whether it is true, false, or if there is not 


) 
iii) 
) 


enough information. Explain your reasoning. 
i) f is constant. 

ii) The slope of the tangent line to the graph of f at 3 is 4. 

iii) f is continuous on (—2,5). 

) 

) 


iv) The derivative of (fog) at 1 is —3. 


v) (f +9)'(3) =2. 
6.2.8. Let f(x) = 4. 
i) Find all a in the domain of f such that the tangent line to the graph of f at a has 
slope —1. 
ii) Find all a in the domain of f such that the tangent line to the graph of f at a has 
slope 2. (Solutions need not be real.) 


6.3. The Mean value theorem 


In this section the domains and codomains of all functions are subsets of R. 


Theorem 6.3.1. Let f : [a,b] — R, and let c € [a,b] such that f achieves an extreme 
value at c (i.e., either for all x € [a,b], f(c) < f(a) or for all x € [a,b], f(c) > f(x)). Then 
at least one of the following holds: 

(1) c=a; 

(2) c=); 

(3) f is not continuous at c; 

(4) f is not differentiable at c; 


226 Chapter 6: Differentiation 


(5) f is differentiable at c and f'(c) = 0. 


Proof. It suffices to prove that if the first four conditions do not hold, then the fifth one 
has to hold. So we assume that c 4 a, c # b, and that f is differentiable at c. 

Suppose that f’(c) > 0. By the definition of derivative, f’(c) = lim,.. fa) fe) 
Thus for all x very near c but larger than c, fe) Ke) > 0, so that f(x) — f(c) > 0, so 
that f does not achieve its maximum at c. Also, for all x very near c but smaller than c, 
fe) fle) > 0, so that f(x) — f(c) < 0, so that f does not achieve its minimum at c. This 
is a contradiction, so that f’(c) cannot be positive. Similarly, f’(c) cannot be negative. 
Thus f’(c) = 0. 


Thus to find extreme values of a function, one only has to check if extreme values 
occur at the endpoints of the domain, at points where the function is not continuous or 
non-differentiable, or where it is differentiable and the derivative is 0. One should be aware 
that just because any of the five conditions is satisfied, we need not have an extreme value 
of the function. Here are some examples: 


(1) The function f : [—1,1] — R given by f(x) = x? — x has neither the maximum 


nor the minimum at the endpoints. 


: an 1a 0: ; : 
(2) Let f : [-1,1] > R be given by f(x) = { 1/2, if <0. Then f is not continuous 


at O but f does not have a minimum or maximum at 0. 


: fe SO: 
(3) Let f : [-1,1] > R be given by f(x) = ee ifr <0. 


and not differentiable at 0, yet f does not have a minimum or maximum at 0. 
(4) Let f : [-1,1] > R be given by f(x) = x?. Then f is differentiable, f’(0) = 0, but 


f does not have a minimum or maximum at 0. 


Then f is continuous 


Theorem 6.3.2. (Darboux’s theorem) Let a < b be real numbers, and let f : [a,b] > 
R be differentiable. Then f' has the intermediate value property, i.e., for all k between 
f'(a) and f'(b) there exists c € [a,b] such that f’(c) =k. 


Proof. If f'(a) = k, we set c = a, and similarly if f’(b) = k, we set c = b. So we may 
assume that k is strictly between f’(a) and f’(b). 

The function g : [a,b] > R given by g(x) = f(x) — ka is differentiable, hence 
continuous. Note that g'(x) = f’(x) — k, so that 0 is strictly between g/(a) and g’(b). 
If g(a) > 0, let c € [a,b] such that g achieves a maximum at c, and if g'(a) < 0, let 
c € [a,b] such that g achieves a minimum at c. Such c exists by the Extreme value theorem 
(Theorem 5.2.2). Note that g'(a) < 0 if and only if f’(a) < k, which holds if and only 
if f’(b) > k, which in turn holds if and only if g’/(b) > 0. Similarly g’(a) > 0 if and 
only if g'(b) < 0. Thus for both choices of c, c cannot be a or b, so that c € (a,b). By 
Theorem 6.3.1, g'(c) = 0. Hence f’(c) = k. 
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Theorem 6.3.3. (Rolle’s theorem) Let a,b € R with a < b, and let f : [a,b] ~ R bea 
continuous function such that f is differentiable on (a,b). If f(a) = f(b), then there exists 
c € (a,b) such that f’(c) = 0. 


Proof. By the Extreme value theorem (Theorem 5.2.2) there exist 1,u € [a,b] such that 
f achieves its minimum at / and its maximum at u. If f(J) = f(u), then the minimum 
value of f is the same as the maximum value of f, so that f is a constant function, and so 
f'(c) = 0 for all c € (a,b). 

Thus we may assume that f(/) 4 f(u). It may be that f achieves its minimum at 
the two endpoints, in which case u must be strictly between a and b. Similarly, it may be 
that f achieves its maximum at the two endpoints, in which case / must be strictly between 
a and b. In all cases of f(l) 4 f(u), either / or u is not the endpoint. 

Say that J is not the endpoint. Then a <1 < b. For all x € [a,b], f(x) > f(d), 
so that in particular for all x € (a,l), fe) i@ < 0 and for all x € (1,5), fa) FO > 0. 
Since:.f'(1) = lintg5; fe) FO exists, it must be both non-negative and non-positive, so 
necessarily it has to be 0. 

If instead u is not the endpoint, then a < u < b and for all x € [a,b], f(x) < f(u). 
Thus in particular for all x € (a,u), Fe) Fu) > 0 and for all x € (u,b), f(a) flu) <0. 
f(«)—f(u) 


Since: f’ (a). = limg-, exists, it must be both non-negative and non-positive, so 


necessarily it has to be 0. Thus in all cases we found c € (a,b) such that f’(c) = 0. 


Theorem 6.3.4. (Mean value theorem) Let a,b € R with a < b, and let f : [a,b] > R 
be a continuous function such that f is differentiable on (a,b). Then there exists c € (a, b) 
such that f'(c) = £O)- fa) 


Here is an illustration of this theorem: the slope fo) Ka) of the line from (a, f(a)) 
to (b, f(b)) also equals the slope of the tangent line to the graph at some c between a and b: 


Y 


Proof. Let g : [a,b] + R be defined by g(x) = f(x) LO) fla) (x —a). By the sum and 


scalar rules for continuity and differentiability, g is continuous on [a,b] and differentiable 


on (a,b). Also, g(a) = f(a) and g(b) = f(b) — L4-£ (6 — a) = f(b) — (f(@) - f(a) 
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= f(a) = g(a). Thus by Rolle’s theorem, there exists c € (a,b) such that g’/(c) = 0. But 


a 


so that 0 = g/(c) = f’(c) — £0) Ha) whence f’(c) = ae 


The rest of this section consists of various applications of the Mean value theorem. 
More concrete examples are left for the exercises. 


Theorem 6.3.5. Let a,b € R with a < b, and let f : [a,b] — R be a continuous function 
such that f is differentiable on (a, b). 

(1) If f’(c) => 0 for all c € (a,b), then f is non-decreasing on {a, }}. 

(2) If f’(c) > 0 for all c € (a, 


( 

( , then f is strictly increasing on [a,b]. 
(3) If f'(c) <0 for all c € (a, 

( 

( 


c 
C , then f is non-increasing on [a, }]. 
(4) If f'(c) <0 for allce€ 


(5) If f’(c) =0 for all ce 


a,b), then f is strictly decreasing on |a, }]. 


ee pee Lie, a eee 
a Ss NR Ne 


C a,b), then f is a constant function. 


Proof of part (2): Let x,y € [a,b] with x < y. By Theorem 6.3.4 there exists c € (z, y) 
such that f’(c) = Tey) Since f’(c) > 0 and x < y, necessarily f(x) < f(y). Since x 
and y were arbitrary with x < y, then f is strictly increasing on |[a, 0]. 


Example 6.3.6. For all x > 1, 4x? > 2x + 2. 


Proof. Let f(z) = 4x23 — 2a — 2. Then f is differentiable and f’(z) = 12x2 — 2x = 
2x(6xz — 1) > 0 for all x > 1. By the previous theorem, f is strictly increasing on [1, 00), 
so that for all x > 1, 4x3 — 22 —2 = f(x) > f(1) =0 and so 4x? > 24 +2. 


Theorem 6.3.7. (Convexity of graphs) Let f be a function that is continuous on |a, }] 
and twice-differentiable on (a,b). 
(1) If f’” > 0 on (a,b), then the graph of the function lies below the line connecting 


(a, f(a)) and (0, f(b). 
(2) If f’ <0 on (a,b), then the graph of the function lies above the line connecting 
( 


(a, f(a)) and (6, f(b). 


Proof. The equation of the line is y = LOM) (gy _ a) + f(a). 

(i) Let x € (a,b). We need to prove that f(x) < FO) HO) —a)+ f(a). By 
Theorem 6.3.4 there exists c € (a,x) such that f’(c) = fe) Ka) and there exists d € (a, x) 
such that f’(d) = £O) f@) Since f” > 0 and c < x < d, necessarily He) Ka) = LO) fe) 


—Zx 


By cross-multiplying by positive (x — a)(b — x) we get that 
(f(x) — f(a))(b— 2) < (f(b) — f(x)) (x — a), 
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or in other words, that f(x)(b— a) < f(a)(b— 2) + f(b)(@ — a) = f(a)(b— a) + (f(b) - 
f(a))(a — a), and by dividing by positive b — a yields the desired inequality. 


The proof of (ii) is similar. 


Theorem 6.3.8. (Cauchy’s mean value theorem) Let a < b be real numbers and let 


f,g : [a,b] > R be continuous functions that are differentiable on (a,b). Then there exists 
€ (a,b) such that 


In particular, if g'(c) 4 0 and g(b) 4 g(a), this says that a = Pamae 


Proof. Define h : [a,b] + R by h(x) = f(x)(g(b) — g(a)) — g(x) (f(b) — f(a)). Then h is con- 
tinuous on |a, b] and differentiable on (a,b). Note that h(a) = f(a)(g(b)—g(a)) —g(a)(f(b)— 
f(a)) = f(a)g(b) — g(a) f(b) = h(b). Then by the Mean value theorem (Theorem 6.3.4) 
there exists c € (a,b) such that h’(c) = 0, ie., 0 = f’(c)(g(b) — g(a)) — g'(c)(f(b) — f(a)). 


Theorem 6.3.9. (Cauchy’s mean value theorem, II) Let a < b be real numbers and 


let f,g : [a,b] + R be continuous functions that are differentiable on (a,b) and such that 


g' is non-zero on (a,b). Then g(b) 4 g(a), and there exists c € (a,b) such that 


fo) _ f) — fa) 

g'(c) — g(b) — g(a) 
Proof. By the Mean value theorem (Theorem 6.3.4) there exists c € (a,b) such that 
ote). = a0) 9a) | By assumption, g'(c) # 0, so that g(b) 4 g(a). The rest follows by 


b-—a 


Cauchy’s mean value theorem (Theorem 6.3.8). 


Exercises for Section 6.3 


6.3.1. Let f : R— R be a polynomial function of degree n. 


i) If f has m distinct roots, prove that f’ has at least m — 1 distinct roots. 
ii) Suppose that f has n distinct roots. Prove that f’ has exactly n—1 distinct roots. 
(You may need Theorem 2.4.15.) 
iii) Give an example of f with n = 2, where f has no roots but f’ has 1 root. Why 
does this not contradict the first part? 
6.3.2. Let f : [0,1] — C be given by f(x) = 2? +ix?. Prove that there exists no c between 
0 and 1 such that f’(c) = £10) | Does this contradict the Mean value theorem? 
6.3.3. Let f : [a,b] > C be continuous on [a,b] and differentiable on (a,b). Prove that 
there exist c,d € (a,b) such that Re(f’(c)) = Re(L@=fe)) and Im(f’(d)) = Im(f@EL@) | 
Give an example showing that c may not equal to d. Does this contradict the Mean value 
theorem? 
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6.3.4. Prove the remaining parts of Theorem 6.3.5. 


6.3.5. This water-filled urn is getting drained at a constant rate through a hole at the 
bottom. Plot the function of depth (d) as a function of time. 


6.3.6. Let A be an open interval in R, and let f : A > R be differentiable such that the 
range of the derivative function is bounded. Prove that f is uniformly continuous. 


6.4 L’Hopital’s rule 


For L’Hopital’s rule we pass from f to f. it is worth reviewing that the quotient 


(£) - f'g — fo! 
g ge 


Theorem 6.4.1. (L’H6pital’s rule, easiest version) Let A C C, let a € A be a limit 
point of A, and f,g: A— C such that 

(1) f(a) = g(a) =0. 

(2) f and g are differentiable at a. 

(3) g'(a) #0. 


rule for derivatives is different: 


Fey) 
ae aa 


/ 
a= F(a) (by the quotient rule for limits since g'(a) 4 0). 


g'(a) 


The rest of the versions of L’Hopital’s rule in this section only work for real-valued 


functions on domains that are subsets of IR because the proofs invoke the Cauchy’s mean 
value theorem (Theorem 6.3.8). 
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Theorem 6.4.2. (L’H6pital’s rule) Let a,L € R and let f,g be real-valued functions 
defined on an interval containing a such that 

(1) f(a) = g(a) =0. 

(2) f and g are differentiable near a but not necessarily at a. 

(3) g' is non-zero near a. (This is a condition for x near a but not equal to a.) 

/ 
(i tig eG, 
ra g(x) 


Then lim Ae 
za g x) 


Proof. Let « > 0. Since lim 1 £2 = L, there exists 6 > 0 such that for all x in the domain, if 


0 < |a—a| < 6 then | Ha)” L| < «. Let x be one such number. By Theorem 6.3.9 there exists 


a number ¢ strictly between a and x such that £9 = f-S@ | gince fia) = gla) =0; 


g'(c) ~~ g(«)-g(a)* 
this says that £ ig = aoe Since 0 < |x —a| < 6 and c is between a and 2, it follows that 
0 < |c—a| < 6. Hence 
/ 
(Om oe ee 
g(x) g'(e) 


With our definition of derivatives, this last version includes the one-sided cases where 
the domains for f and g are of the form [a,b), or of the form (6, aj. 

I should note that some books omit hypothesis (3). A counterexample to the con- 
clusion if we omit this hypothesis but keep all the others is in Exercise 10.4.7. 

The versions of L’H6pital’s rule so far deal with limits of the form “zero over zero”. 
There are versions for the form “infinity over infinity”; I write and prove only the right-sided 
version, but similarly there is a left-sided version and a both-sided version. 


Theorem 6.4.3. (L’H6pital’s rule) Let L and a < b be real numbers. Let f,g : (a,b) > 
R be differentiable with the following properties: 
(1) lim f(x) = lim g(t) =o. 
x—b- x—b- 
(2) For all x € (a,b), g'(x) £0. 
Me 


File) _ 
is sane g(x) 
Then lim fa) =L 
a b- g(x) 


Proof. Let « > 0. By assumption there exists 6; > 0 such that for all x € (a,b), if 
0<b—«a < 61, then oe — L| < €/4. By possibly replacing 6; by min{6é1, (b — a)/2} we 
may assume that b — 6; > a. 

Set ag = b— 6). Fix x such that 0 < b—a@ < 6;. Then x € (ao,b) 
Suppose that g(x) = g(ao). Then by the Mean value theorem (Theorem 6.3.4) there exists 
u € (ao, 2) such that g/(u) = 0, which contradicts the assumption (2). Thus g(x) 4 g(ao). 


€ s(asb): 
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By Theorem 6.3.9 there exists c € (a9,x) such that re = Hae ee and so 


oe / 
He) = Hoo) _j) JL _ a) can 
g(x) — g(ao) g'(¢) 
Since lim {z= lim g(x) = ov, there exists 62 > 0 such that for all « with 
a—b- xr—>b- 
0<b—«2 < 62, f(x) and g(x) are non-zero, and so we can define h : (b — 62,6) > Ras 

{= f (ao) 

eee (CE 

hae) = 1 alae)” 
g(x) 


By Theorem 4.5.7, lim,_,,- h(x) = i= = 1. Thus there exists 63 > 0 such that for all x, 
if 0 <b—2 < 6s, then |h(x) — 1| < min{e/4(|Z| +1), 5}. The (4)-restriction in particular 
means that h(x) > 4 and thus non-zero. Set 6 = min{d,, 62,63}. Then for all x with 


0<b-2< 0, 


f(x) | | (x) 1 
——~ — [| = |——A(z) — LA(z) 
g(@) g(2) |n(a)| 
= 1 
g(2) — g(ao) |n(a)| 
f(x) — f(ao) 
<2 Lh(x since |h(x)| > 1/2 
a Fe) — pn(a)| (since |a(e)| > 1/2) 
= 2 fo— fe) LE L— tile) 
g(#) — g(ao) 
(by addition of a clever zero) 
g(x) — g(ao) 
(by the triangle inequality) 
f'() ) 
=o — LDL) + |L|\1— hile 
(|55S - 2) + 1211 - ace) 
€  € 
<2(7 +3) 
=€. 
. ye 
Example 6.4.4. Compute iim ae 
Proof #1: By Example 1.6.4, =+ = <“72¢*9., so that by Exercise 4.4.8 and Theo- 
. yee . x 
rem 4.4.6, im ay = im ea - eas = §. 


Proof #2: By Theorem 4.4.5, lim (x? — 1) = 0 = lim(a? — 1), lim(x? — 1)’ = lim 22 = 2, 
x1 x1 x1 x1 


a2 


and lim(x® — 1)’ = lim 32? = 3. Thus by L’H6pital’s rule (Theorem 6.4.2), lim 4+ 
zl xl a1 * 


equals 2 : 
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More forms of L’H6pital’s rule are in the exercises below and also in Sections 7.6 and 
10.4 after the exponential and trigonometric functions have been introduced. 


Exercises for Section 6.4 


6.4.1. Compute and justify fm . va 


6.4.2. What is wrong with the following: Since lim, +2 %# = 2, by L’Hopital’s rule we 
conclude that lim,_,2 43 = 2. 


6.4.3. Let n be a positive integer and a € C. Use L’Hopital’s rule to prove that 


nm 
; on 
lims+a Z—3 


6.4.4. (Here the goal is to prove another version of L’H6pital’s rule) Let f,g : (a,co) ~ R 
be differentiable. Suppose that lim fee lim g(x) = oo, that g'(x) is non-zero for all x, 


~=na"!, (We have proved the case of a = 1 previously, say how.) 


and that im ie } = = L for some L € R. 
i) tee € > 0. Prove that there exists N > a such that for all > N, |£ ris} —L|<e. 
ii) Prove that there exists N’ > N such that for all x > N’, f(x), g(x) > 0. 
iii) Prove that there exists N” > N’ such that for all > N”, f(x) > f(N’) and 
g(x) > g(N"). 
iv) Prove that for all xz > N” there exists c € (N’, x) such that PAG) as FAN) 


g'(c) ~~ g(@) 1-g(N")/g(a) ° 
v) Prove that Jim f ne =D: 


6.4.5. (Yet euler version of L’H6pital’s rule) Let f,g : (a,oo) — R be differentiable. 
Suppose that Jim (a= Jim g(x) = 0, that g/(x) is non-zero for all x, and that lim 


LOCO 
fay — 7, Bide that Jim “1) _ — we 
g(x) g(x) 


6.5 Higher-order derivatives, Taylor polynomials 


Let f be a function from a subset of C to C. 

If f is continuous at a, then near a, f is approximately the constant function f(a) 
because lim,-,_ f(x) = f(a). Among all constant functions, the function y = f(a) approx- 
imates f at a best. 

If f is differentiable near a and the derivative is continuous at a, then for all x near a 
but not equal toa, f(x) = f(«)—f(a)+f(a) = P=L% (w@—a)+ f(a) = f'(a)(e«—a) + f(a), 
so that for x near a (and possibly equal to a), f is approximately the linear function 
f'(a)(a — a) + f(a), ie., f is approximated by its tangent line. 

This game keeps going, but for this we need higher order derivatives: 
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Definition 6.5.1. Let f be differentiable. If f’ is differentiable, we write the derivative 
of f' as f", or as f). If f-) is differentiable, we denote its derivative as f‘™. We say 
that f has the nth derivative, or that it has derivatives up to order n. 

Using this notation we also write f“) = f’ and f = f. 


If f(x) = 2™, then for all n < m, f(x) = m(m — 1)(m — 2)---(m—n+1)a™-" 


Definition 6.5.2. Let f be a function with derivatives of orders 1,2,...,n existing at a 
point a in the domain. The Taylor polynomial of f (centered) at a of order n is 


f"(@) pee a (a) 


2! n! 


n 


Tn,fa(v) = f(a) + f"(a)(@ — a) + (x —a)?+- 


(x — a) 


k=0 
Example 6.5.3. If f(x) = «4 — 3x? + 427 + 7x — 10, then with a = 0, 
To, f,0(2) = 
Ti, ¢.0(@) = —10 +72, 
To, 5,0(x) = —10 +72 + 42”, 
(Cae 10 Te de? Se". 
(x) = -10 + 7x + 42? — 32° + 2° for all n > 4, 


T3, ¢,0(x 
eran 


and for a = 1, 


To, f.1(@) = 

Ti,¢1(%) = -14 10(2 — 1), 

To, ¢.1(x) = -14+10(2 —1) + (x — 1)?, 

T3 f1(@) = —1+10(% —1)+ (« —1)? +(# — 1), 

Ty,g.1(@) = —1+10(@ —1) + @—-1)? + (e—-1)* + (2 —1)* for alln > 4. 


Note that for all n > 4, Th ¢o(x) = Tn, p(x) = f(a). 
The following is a generalization of this observation: 


Theorem 6.5.4. If f is a polynomial of degree at most d, then for any a € C and any 
integer n > d, the nth-order Taylor polynomial of f centered at a equals f. 


Proof. Write f(x) = cg +e,2+---+cgx% for some cp, C1,...,Cq € C. By elementary algebra, 
it is possible to rewrite f in the form f(x) = eg + e1(a — a) + e2(@ — a)? +--+ + eq(x —a)4 
for some €9, €1,...,€q € C. Now observe that 


f(a) = eo, 
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f'(@) = a1, 

f" (a) = 2e2, 

f’" (a) = 6e3 = Bles, 
f (a) = 24e4 = Alea, 


f(a) =O0ifk>d. 


But then for n > d, 


Theorem 6.5.5. (Taylor’s remainder theorem over R) Let I be an interval in R, 
and let (a—r,a+r) = B(a,r) C I. Suppose that f : I > R has derivatives of orders 
1,2,...,n +1 0n B(a,r). Then for all x € B(a,r) there exist c,d between a and x such 
that 


(n+1) Cc 
(0) = Tn gala) + De ay(e— 0)", 
fora) n+1 
(n+ 1)! ay: 


fa) = Thala 


Proof. Let g: B(a,r) + R be defined by 


ne £(k) 
at) = OM eH, 


k=0 


Then g is differentiable on B(a,r), and 


My fCRFD ye (k) 
g(t) = Ss. f - (t) (x i)" ? kf 7 (t) (x ie 

k=0 , k=0 : 

(n+1) et eee) Me F(R) 
-_ f a (t) (x ty? Ss > f 7 (t) (x ae d ile pe 
=0 k=1 

(n+1) (4 

eeanlOim, 


Note that g(a) = f(x), g(a) = Tn, p.a(x). By the Mean value theorem (Theorem 6.3.4), 
there exists c strictly between a and x such that g’(c)(a—a) = g(x) — g(a). In other words, 
rPrO ( —c)"(x — a) = f(x) — Tn, ,a(2), which proves the first formulation. 

By Cauchy’s mean value theorem (Theorem 6.3.8) applied to functions g(t) and 
h(t) = (x—t)"*!, there exists d between x and a such that h’(d)(g(x)—g(a)) = g’(d)(h(x) — 
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h(a)). In other words, 


= = n = > fort ( ) n -_ = n+1 
(n + 1)(x — d)"(F(@) — Th p,a()) = —— (# — a)"(0— (@ — ah"). 


(n4+1 
Since d is not equal to x, we get that —(n+1)(f(x) — Th f,a(x)) = sits a) ( (x —a)"*?), 


n! 


which proves the second formulation. 


Theorem 6.5.6. (Taylor’s remainder theorem over C) Let A be a subset of C or 
of R, let a € A, and let r € R* such that B(a,r) C A. Let f : A > C have higher order 
derivatives of orders 1,2,...,n on B(a,r) with n > 1. Then for every « > 0 there exists 
6 > 0 such that if x € B(a,6), then 


|f(%) — Lnfalt)| <€. 


Proof. Let € > 0. Let M = 1+max{|f(a)|,|f/(a)|,...,| f° (a)|}. Since f is differentiable 
at a, it is continuous at a, so there exists 6, > 0 such that for all x € B(a,r), if |r—a| < 61, 


then | f(x)—f(a)| < €/(n+1). Set 6 = min{r, 61, €/(M(n+4+1)) aaa ICES EE 


Let «x satisfy |x — a] < 6. Then zx is in the domain of f and T;,,¢,2, and 
|f(2)-Tn,p,a(2)| 


"a (n)(q 
= |#(@)— #0) — fF @)(@ — a) - E99 (@ — ay? —... 19 ean 
” (n)(q 
< Ife) FoI + 14") fea) +] | fo — a4} EI a — a 
Set Mae Mee AMD 


= €. 


More on Taylor polynomials and Taylor series is in the exercises in this section, in 
Exercise 7.4.9, and in Section 9.3. 


Exercises for Section 6.5 


6.5.1. (The second derivative test) Let f : [a,b] — R be differentiable on (a,b) and 
suppose that f’ is differentiable on (a,b) as well. Suppose that f’(c) = 0 for some c € (a,b). 
i) Suppose that f’(c) > 0. Prove that f achieves a minimum at c. 
ii) Suppose that f’’(c) < 0. Prove that f achieves a maximum at c. 
iii) Suppose that f’(c) = 0. By examples show that any of the following are possible: 
a) f achieves a minimum at c, b) f achieves a maximum at c, c) f does not have 


an extreme point at c. 
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6.5.2. Prove that if f is a polynomial function, then for every a € R, Ty 5.4 = f for all n 
greater than or equal to the degree of f. 


6.5.3. (Generalized product rule.) Suppose that f and g have derivatives of orders 1,...,” 


at a. Prove that 


Fa) => (PY Fag"), 


k=0 


6.5.4. (Generalized quotient rule.) Prove the following generalization of the product rule: 
Suppose that f and g have derivatives of orders 1,..., at a and that g(a) 4 0. Find and 
prove a formula for the nth derivative of the function f/g. 


6.5.5. Compute the Taylor polynomial of f(z) = “1+. of degree 5 centered at a = 0. 
Justify your work. 


6.5.6. Compute the Taylor polynomial of f(x) = 1 —< of degree 10 centered at a = 0. 
Justify your work. 


6.5.7. Let f(z) = kh. 
i) Compute f(x) for all integers k > 0. 


) 

ii) Compute the Taylor polynomial of f of arbitrary degree n and centered at a = 0. 
iii) Compute T,,,7,9(0.5). (Hint: Use Example 1.6.4.) 

iv) Compute T,,,¢,0(0.5) — f(0.5). 

v) Compute n such that |T;,,¢9(0.5) — f(0.5)| < 0.001. 

vi) Try to use Theorem 6.5.5 to determine n such that |T;,,¢,0(0.5) — f(0.5)| < 0.001. 
Note that this usage is not fruitful. 


vii) Use Theorem 6.5.5 to determine n such that |T),, ¢,0(0.4) — f(0-4)| < 0.001. 


6.5.8. Let f :C-—R be the absolute value function. 
i) Prove that f has derivatives of all orders at all non-zero numbers. 


ii) Compute T> +. 
6.5.9. Let f :R—R be given by f(z) = |2|®. 
i) Prove that f is differentiable, and compute f’. 


ii) Prove that f’ is differentiable, and compute f”. 
iii) Prove that f” is not differentiable. 


6.5.10. Find a differentiable function f : R— R such that f’ is not differentiable. 


6.5.11. Find a function f : R — R that has derivatives of orders 1, 2,3,4, but such that 
f is not differentiable. 


Chapter 7: Integration 


The basic motivation for integration is computing areas of regions bounded by graphs 
of functions. In this chapter we develop the theory of integration for functions whose 


domains are subsets of R. The first two sections handle only codomains in R, and at the 


end of Section 7.4 we extend integration to functions with codomains in C. We do not 
extend to domains being subsets of C as that would require multi-variable methods and 
complex analysis, which are not the subject of this course. 


7.1 Approximating areas 


In this section, domains and codomains of all functions are subsets of R. Thus we 
can draw the regions and build the geometric intuition together with the formalism. 

Let f : [a,b] > R. The basic aim is to compute the signed area of the region 
bounded by the x-axis, the graph of y = f(x), and the lines x = a and x = b. By “signed” 
area we mean that we add up the areas of the regions above the x-axis and subtract the 
areas of the regions below the z-axis. Thus a signed area may be positive, negative or zero. 


In the plot above, there are many (eight) regions whose boundaries are some of the 
listed curves, but only the shaded region (comprising two of the eight regions in the count) 
is bounded as a subset of the plane. 

The simplest case of an area is when f is a constant function with constant value c. 
Then the signed area is c- (b — a), which is positive if c > 0 and non-positive if c < 0. 

For a general f, we can try to approximate the area by rectangles, such as in the 
following approximations with crosshatched rectangles: 


ISL 
Le 
b 


KOT 
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It may be hard to decide how close the approximation is to the true value. But we 
can approximate the region more systematically, by having heights of the rectangles be 
either the least possible height or the largest possible height, as below: 


SSF 


Kx 
EO EAOSe, 


ae 
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—L RIL 
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Then clearly the true area is larger than the sum of the areas of the darker rectangles 


on the left and smaller than the sum of the areas of the darker rectangles on the right. 
We establish some notation for all this. 


Definition 7.1.1. A function f : A > C is bounded if the range {f(x): ax € A} of f is 


a bounded subset of C. 


Remark 7.1.2. If f is a bounded function with codomain R, then by the Least upper 
bound theorem (Theorem 3.8.5), for any subset B of the domain, sup{f(x) : x € B} and 


inf{ f(a) : « € B} are real numbers. 


Definition 7.1.3. A partition of [a,b] is a finite subset of [a,b] that contains a and b. 


We typically write a partition in the form P = {xo, “1,. 


“+9 < En-1 < fy = O. 


The sub-intervals of the partition P are [xo, x4], [71, £9], . 


oie | Bee eis 


..,%n}, where %p =a< 21 < 49 < 


In all illustrated examples above, the partition of [a,b] uses n = 10. Note that the 
sub-interval [71,25] of the interval |a, b| is not a sub-interval of the partition! 
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Definition 7.1.4. Let P = {x0,x1,...,%} be a partition of |a,b] and let f : [a,b] > R 
be a bounded function. The lower sum of f with respect to P is 


n 


Li P= S - inf { f(z) :2@ © [@p_1, &e] } (Ue — Le_1). 


k=1 


The upper sum of f with respect to P is 
U(f,P) = Sosupt f(x) +x € [ex—1, te] Hee — 24-1). 
k=1 


By Remark 7.1.2, L(f, P) and U(f, P) are real numbers. 

Clearly if f is a constant function f(x) = c for all x, then all lower and all upper 
sums are c(b — a), so that every lower and every upper sum equals c(b — a). If instead f 
is a non-constant function, then for every partition P there exists at least one sub-interval 
of P on which the supremum of the values of f is strictly bigger than the infimum of such 
values, so that L(f, P) < U(f, P). 

By the geometric set-up for all partitions P of [a, }], 


L(f,P) < the signed area < U(f, P). (7.1.5) 
In particular, if U(f, P) — L(f, P) < ¢, then either U(f, P) or L(f, P) serves as an approx- 


imation of the true signed area within ¢ of its true value. For most functions a numerical 


approximation is the best we can hope for. 


Theorem 7.1.6. If f(x) =c for all x € |a, |, then for any partition P of |a,b], L(f, P) = 
U(f, P) =c(b—a). 


Proof. Let P = {xo,...,%n} be a partition of [a,b]. For every k € 1,...,n, 


inf { f (ays ee [ep 15- ee] Supt i (a) fee laps ee | LS 


so that 
L(f,P) =U(f,P) = > cw — 2x-1) =) pai (Ck — Ce-1) 
k=1 
= C(¥n — Zo) = c(b—a). 


Example 7.1.7. Approximate the area under the curve y = f(x) = 30x4 + 2x between 
x =land az =4. We first establish a partition P, = {2o0,...,%n} of [1,4] into n equal 
sub-intervals. The length of each sub-interval is (4 —1)/n, and xg = 1, so that 7; = 
to + 3/n = 14 3/n, vo = 41 +3/n = 14+ 2-3/n, and in general, z, = 1+k-3/n. 
Note that rz, = 1+n-3/n = 4, as needed. Since f’(x) = 12x? + 2 is positive on [1,4], 
it follows that f is increasing on [1,4]. Thus necessarily for each 7, the infimum of all 


Section 7.1: Approximating areas 241 


values of f on the ith sub-interval is achieved at the left endpoint, and the supremum at 
the right endpoint. In symbols, this says that inf{f(a) : a € [w,_-1,2%|} = f(a,—-1) and 
sup{ f(x): a © [ap_1, 2%] } = f(ax). For example, with n = 1, L(f, Pi) = f(1)-3 = 96 and 
U(f, P.) = f(4)-3 = 3-(30-44+2-4) = 23064. Thus the true area is some number between 
96 and 23064. Admittedly, this is not much information. The problem is that our partition 
is too rough. A computer program produced the following better numerical approximations 
for lower and upper sums with respect to partitions P, into n equal sub-intervals: 


nr L(f;Pa) OS En) 
10 5061.2757 7358.0757 


100 6038.727 6268.407 
1000 6141.5217 6164.48967 
10000 6151.8517 6154.148 
100000 6152.885 6153.1148 
1000000 6152.988 6153.011 


Notice how the lower sums get larger and the upper sums get smaller as we take finer 
partitions. We would like to conclude that the true area is between 6152.988 and 6153.011. 
This is getting closer but may still be insufficient precision. For more precision, partitions 
would have to get even finer, but the calculations slow down too. 


The observed monotonicity is not a coincidence: 


Theorem 7.1.8. Let P,R be partitions of |a,b| such that P C R. (Then R is called a 
refinement of P, and R is said to be a finer partition than P.) Then 


Lf,P) SLR), USP) 2 USER). 


Proof. Write P = {x0,%1,...,%n}. Let i € {1,...,n}. Let RM [xp_1, 24] = {yo = 
Lke—-1)Y1;--+)Ym—1, Ym = Le}. By set inclusion, inf{f(x) : x € [yj;-1, yj|} = inf{f(z): a2 € 
[ies Wiel ts so that 

Lf, {Yo. V13 oe SUR) ad S "inf { f(z) 2LE [yj—15 Ys} (yy = 4) 


j=l 


> dint{f(2) (oe lena, Cal Gy Swe) 


= inf{ f(x): a € [xp_ 1, €al} Soy Yj — Y5-1) 
j=l 


= inf{ f(x): @ € [a,_1, 24] }(2_ — Ze_-1), 
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and hence that 


n 


L(f, R) = BCP AC | te a5 6a) 


k 
> S- inf{ f(x) DS apa te} (Oe Des) 
k=1 


= L(f, P). 


3 || 


The proof for upper sums is similar. 


Example 7.1.9. Let f(x) = : a aa Then for any partition P = 


{to,---,%n} of [—2,4], for all 4, nf{f(z) : « € [xx_1,2,%]} = 0 and sup{f(z) : x 
te ayep|} = 1, se: that Lf, P):=-0 and UG.) = a 1 * eee eee) = oh = = 
4 — (—2) = 6. Thus in this case, changing the partition does not produce better 


Mm 


approximations. 


Theorem 7.1.10. For any partitions P and Q of [a,b], and for any bounded function 
f: [a,b] > R, 


L(f, P) < U(f,Q). 


Proof. Let R= PUQ. (Then R is a refinement of both P and Q.) By Theorem 7.1.8, 
L(f,P) < L(f, R) and U(f, P) > U(f, R). Since always L(f, R) < U(f, R), the conclusion 
follows by transitivity of <. 


It follows that if f is bounded on |a, bj, then the set of all lower sums of f as P varies 
over all the partitions of [a,b] is bounded above, so that the set of all lower sums has a real 
least upper bound. Similarly, the set of all upper sums is bounded below and has a real 
greatest lower bound. 


Definition 7.1.11. The lower integral of f over [a,b] is 
L(f) =sup{L(f, P) : as P varies over partitions of |a, b]}, 
and the upper integral of f over |a, b] is 
U(f) =inf{U(f,P): P varies over partitions of [a, b]}. 


We say that f is integrable over [a,b] when L(f) = U(f). We call this common 
value the integral of f over [a,b], and we write it as 


[s-[ sous frou 
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Theorem 7.1.10 shows that always L(f) < U(f). Example 7.1.9 shows that some- 
times strict inequality holds. Note that we have not yet proved that the function in Ex- 
ample 7.1.7 is integrable, but in Section 7.3 we prove more generally that every continuous 
function is integrable over a closed bounded interval. 

By Exercise 7.1.3, for non-constant functions the upper and lower sums with respect 
to a partition are never equal. In this case for every partition P, L(f,P) < U(f,P), yet 
for good, i.e., integrable, functions it can happen that L(f) = U(f). 


The simplest examples of integrable functions are constant functions: 


Theorem 7.1.12. (Constant rule for integrals.) If f(x) = c for all x € |a,b], then for 
any partition P of [a,b], [? f = L(f) =U(f) =c(b—a). 


Proof. We apply Theorem 7.1.6 and the definitions. 


Theorem 7.1.13. Let f : [a,b] — R be bounded. Then f is integrable over |a, b| if and 
only if for all € > 0 there exists a partition P- of |a,b] such that U(f, P.) — L(f, Pe) <. 


Proof. Suppose that f is integrable over [a,b]. Then L(f) = U(f). Let « > 0. Since L(f) 
is the supremum of all lower sums L(f, P) as P varies over partitions of [a,b], there exists 
a partition P; of [a,b] such that L(f) — L(f, P.) < €/2. Similarly, there exists a partition 
Py» of [a,b] such that U(f, P2) — U(f) < €/2. Let P = P, UP). Then P is a partition of 
[a, b], and by Equation (7.1.5) and by Theorem 7.1.8, 
Thus since L(f) = U(f), we have that 

= U(f, Po) —U(f) + Lf) — LF, Pr) 

<¢€/2+6/2=€. 

Now suppose that for every « > 0 there exists a partition P. of [a,b] such that 

U(f, P.)-L(f, P-) < «. By the supremum/infimum definitions of lower and upper integrals, 


OS GL) SOC Fe) LG Porc’ 


Since the non-negative constant U(f)—L(f) is strictly smaller than every positive number e, 
it means, by Theorem 2.10.4, that U(f) — L(f) = 0, so that f is integrable. 
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Example 7.1.14. We prove that the following non-continuous function f is integrable 
over [0,1] and that fo f=0: 


fais Te are 4 for some n € NT; 
0, otherwise. 
It is straightforward to verify that this function is not continuous at 0 and at every x of 
the form 1/n for a positive integer n, and that it is continuous everywhere else. For any 
partition P, L(f,P) = 0. So by Theorem 7.1.13 it suffices to prove that for every « > 0 
there exists a partition P of [0,1] such that U(f,P) < «. So let « > 0. We partition 
[(0, 1] so that some sub-intervals are very small and the other sub-intervals have very small 
(zero) supremum of f. By Theorem 3.8.3, there exists a positive integer p such that 
1 < pe. Thus 5 < «. pet a = 
eS see), Seti gt = = ee d and xo, = =o ad d. Clearly rop_1 < Xox < 1, 
L2p = std < l—d = Zap41 < Lap = 1, and Lax < LoK41 because 
1 geen omer eee 
(p—1)p p(p+2) 
[0,1]. For k = 1,...,p, the sub-interval [rex~1, rox] = | 


1 
2p(p+2)° Let to = beaae = 1-—d, Lap42 = 1, and for 


yok DoEFI =. 
= 2d. Thus P = {99;21;---;%2p44} is a partition of 
== d, esl + d] is centered 
Thus these p sub-intervals contain the p numbers 1/2,1/3,...,1/(p +1). The 

ptl pt+l — 2p(p+1 ptl 


(because 55 < aan) and the subinterval [x2p,%2p+1] contains 1/1. Consequently, the 


p—1sub-intervals [x2x, Lon41] = =o +d, ul d| fork = 1,...,p contain no elements 


(p—k+2)(p—k+1) 


1 
on pok+2° 
interval [29,21] = [0 

1 


] contains all 4 for n > p+2 


of the form 1/n for a positive integer n. Thus 


1 eS 4 
U(f,P)= . + ———{ + (Xox — Lop 
(FP) ey) (x1 — Xo) aes: (X2~ — L2~-1) 
p-l 
oF ye 0+ (%on41 — Laz) + (Lap41 — Lap) 
k=1 
ptl 
a4 S28) 4 
k=1 
= (p + 2)(2d) 
_i 
Pp 
<.€. 


Notation 7.1.15. In the definition of integral there is no need to write “dx” when we 
are simply integrating a function f, as in “ fe f”: we seek the signed area determined by 
f over the domain from a to 6. For this it does not matter if we like to plug x or t or 
anything else into f. But when we write “f(x)” rather than “f”, then we need to write 
“dx”, and the reason is that f(a) is an element of the codomain and is not a function. 
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Why do we have to be pedantic? If x and t are non-dependent variables, by the constant 
rule (Theorem 7.1.12) we then have 


b 
/ f(a) dt = f(x)(b— a), 


and specifically, by geometric reasoning, 


4 4 
| xdx = 8, i; xdt = 4x. 
0 0 


Thus writing “dx” versus “dt” is important, and omitting it can lead to confusion: is 
the answer the constant 8, or is it 4% depending on x? Furthermore, if 2 and ¢ are not 
independent, we can get further values too. Say if « = 3¢t, then 


4 4 
il cat= f 3t dt = 24. 
0 0 


In short, we need to use notation precisely. 

The following theorem says that to compute (lower, upper) integrals of integrable 
functions, we need not use all the possible partitions. It suffices to use, for example, only 
partitions into equal-length sub-intervals. The second part of the theorem below gives a 
formulation of integrals that looks very technical but is fundamental for applications (see 
Section 7.7). 


Theorem 7.1.16. Let f : [a,b] + R be bounded and integrable. For each real number 
r > Olet P. = {2 ol”, 2.0} be a partition of [a,b] such that each sub-interval 


[o”,, a”) has length at most r. Then 


. . 
lim De) ay UG Py) -{ ie. 


rot 


Furthermore, if for each r > 0 and each k = 1,2,...,n,, of) is arbitrary in 


[o”,, a”), then 


rot 


Nr b 
lim ye F(a” - al") ,) = / re 
k=1 a 


Proof. Let « > 0. By Theorem 7.1.13, there exists a partition P of [a,b] such that 
U(f,P) —L(f,P) < ¢/2. Let P = {yo,y1,.--,Yn}. Since f is bounded, there exists a 
positive real number M such that for all x € [a, bj, |f(a)| < M. Let r be any positive real 
number such that 


We 
r<=min LY1 — YoYo — Yise+-Yn — na} 


€ 
2 aa 
and let P, = {x0,%1,..-,%m,} (omitting the superscript (r)). By the definition of r, 
each sub-interval |r,_1,2,] contains at most one element of P. When it does contain an 
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element of P, we call such k special and we denote its point of P as y;(x). Thus [7,~1, 7%] = 
[ve-1, ¥5(k)] U [Yj(n), Te]. The elements yo and y, are each in exactly one sub-interval of 
P,, whereas the n— 1 points y1,...,Yn—1 may each be contained in two sub-intervals of P, 
(when it is also in P,), so that there are at most 2+2-(n—1) = 2n special integers. For 
non-special k there exists a unique i(k) € {1,...,n} such that [x,-1, 2%] C [Yacn)—15 Yacr)]- 
Then 


Mr 


L(f, Pr) = >> inf{ f(x) : © € [xe_1, 28] }(@e — te-1) 
k=1 
= So inf f(e) sa € fea, ee] (ve — 2x1) 
special k=1 


Mr 


-- > inf{ f(a): a © [ap_1, 4] }(LR — Le-1) 


non-special k=1 


The last row is greater than or equal to 


Mr 


iS inf{ f(x) : @ € [yicny—1, Yaa] }(@e — Te-1); 
non-special k=1 
which is not quite equal to L(f,P) because for each i € {1,...,n}, the union of the sub- 
intervals [x,_1, 7%] contained in [y;_1, y;] need not be all of [y;_1, y;]. In fact, L(f, P) equals 
the summand in the last row plus 


Mr 


>> inf f(w) = @ € [yay—1, ¥3@)] }(Yi@e) — BH-1) 
special k=1 
a y inf{ f(z) : © € [yycny ¥ce) 41] }(@R — Y5(K))- 
special k=1 
Thus 
EGP) > ), Amis @) ie [eeastel}@e= 1) + EG, P) 
special k=1 
— SP inf f(a): & € [yjcmy—1, 9c) ]} (Y5¢m) — @e-1) 
special k=1 
= Sy inf{ f(x) : © & [y5¢n), Yy(e)-ral }(@e — Yy(Ry)- 
special k=1 


>- S$) M(ap -2y-1) + L(f,P) 


special k=1 


Section 7.1: Approximating areas 247 


— So) Miyjay-tea)- SY) M(ae-y50%)) 
special k=1 special k=1 
=L(f,P)-— >> 2M(xx_ - 2-1) 
special k=1 
> L(f, P) —2M2nr 
> L(f,P)— 5. 


Since U(f,P) > U(f) = L(f) = L(f,P) and U(f,P) — L(f,P) < §, it follows that 
L(f) — L(f, P) < «/2. Thus by the triangle inequality and minding which quantities are 


larger to be able to omit absolute value signs, 
€ 
Lf) - Lf, Pr) < Lf) - USP) +5 <6 
This then proves that lim,_,9+ L(f,P,) = L(f) = i f. Similarly we can prove that 
U(f, P,) — U(f) < € and so that lim,_,9+ U(f, P-) = U(f) = J? f. 


Furthermore, by compatibility of order with multiplication by positive numbers and 
addition, we have that for each r, 


UG PIS IG VG. 2, SUP: 
k=1 


So that for all r sufficiently close to 0, 377", (ch?) (a — a”),) is within € of hs f. Thus 
by the definition of limits, lim,.9+ 07", F(a? - al”),) = ie Fe 


Exercises for Section 7.1 
7.1.1. Prove that if P = {xo,...,@,} is a partition of [a,b] into n equal parts, then 
C,r=at k2=2, 
1, ifa<2; 
0; ia > 2s 
and Q the partition of [0,3] into three equal intervals. 

i) Compute L(f, P) and U(f, P). 

ii) Compute L(f,Q) and U(f,Q). 

iii) Compare L(f,P) and L(f,Q). Why does this not contradict Theorem 7.1.8? 


7.1.3. Let P be a partition of [a,b], and suppose that L(f,P) = U(f,P). Prove that f 
must be constant on [a, )]. 


7.1.2. Let f(x) = Let P be the partition of [0,3] into two equal intervals 
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7.1.4. Use geometry to compute the following integrals: 


: 5, if x < —7; 
: 2, tp <e< | 
iv) ff, where f(z) = Se i Lew < 3: 
9-2, if3<z. 
7.1.5. Use geometry to compute the following integrals (t and x do not depend on each 


other). Justify all work. 


i) [ ve=wat- 
5 
i) [ (ae — 10) at = 


‘ 5, if t < —7; 
ee 2, if -7<t<1: 
iii) i: where f(t) = 3t, ifl<t<3: 
9-2, if3<t. 


7.1.6. Let P,, be a partition of [—1,1] into n equal parts. Let f(z) = 1 if « = 0 and 
f(x) = 0 otherwise. 

i) Graph f and conclude that f is not continuous at 0. 

ii) Compute L(f, P,) and U(f, P,). 
7.1.7. EasyLanders define Eantegrals as follows: they divide the interval [a, b] into n equal 
sub-intervals, so each has length baa By Exercise 7.1.1, the kth sub-interval given in this 
way is [a + (k — 1) baa atk pea) Rather than finding the minimum and maximum of 
f on this sub-interval, they simply Eapproximate f on the sub-interval by plugging in the 
right endpoint a+k baa so that the signed area over the kth sub-interval is Eapproximately 
iF (a + k,2=2) baa Thus the Eapproximate signed area of f over [a,b] via this partition is 
ae (a + ke) baa If this sum has a limit as n goes to infinity, then EasyLanders 
declare the Eantegral of f over [a,b] to be 


b ; is b—a\ b-a 
ifm dads (ork n ) n . 


b 
Suppose that f : [a,b] > R is integrable. Prove that the Eantegral f f exists and equals 


the integral i rie 
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1, if x is rational; 


7.1.8. Use Eantegrability from the previous exercise. Let f(x) = { i ahec ei eatoaal 


By Example 7.1.9 we know that f is not integrable over [0, 2]. 
2 
i) Prove that f is Eantegrable over [0,2] and find its Eantegral f if 
0 


ii) Compute (7 (a and prove that hs + (a eo fr 


7.2 Computing integrals from upper and lower sums 


The definition of integrals appears daunting: we seem to need to compute all the 
possible lower sums to get at the lower integral, all the possible upper sums to get at the 
upper integral, and only if the lower and upper integrals are the same do we have the precise 
integral. In Example 7.1.7 in the previous section we have already seen that numerically 
we can often compute the integral to within desired precision by taking finer and finer 
partitions. In this section we compute some precise numerical values of integrals, and 
without computing all the possible upper and lower sums. Admittedly, the computations 
are time-consuming, but the reader is encouraged to read through them to get an idea of 
what calculations are needed to follow the definition of integrals. In Section 7.4 we will see 
very efficient shortcuts for computing integrals, but only for easy/good functions. 


Example 7.2.1. Let f(x) = x on [2,6]. We know that the area under the curve between 
6 


x =2 and x = 6 is 16. Here we compute that indeed f = 16. For any positive integer n 


2 
let P, = {20,.--,%n} be the partition of [2,6] into n equal parts. By Exercise 7.1.1, 
tp=2+ kA, Since f is increasing, on each sub-interval [a,~1, 7%] the minimum is x,_1 
and the maximum is x;. Thus 


U(f, P,) = > tei — Dj) 


| 
aN 
~ 
ae 
a 
Si xa 
NY 
i 


k=1 
n 4 n 4 2 
=Sr244 (=) 
n n 
k=1 k=1 
4 oar ee sn 
=2—-n+ es ( ) 
n 2 n 
8 1 
Bere ue 
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It follows that for all n, 


U(f) =inf{U(f, P) : P varies over partitions of [2, 6]} 


k=1 
=> (2+-0;) “ 
=e (4) 
=e mee un (4) 
a4 ga 


and 
L(f) =sup{L(f,P): P varies over partitions of [2, 6] } 
> sup{L(f, Pa): 2 € NT} 
= 16. 
All together this says that 
16 < L(f) < U(f) < 16, 


so that L(f) = U(f) = 16, and finally that ie { = 16: 
Note that we did not compute all the possible lower and upper sums, but we com- 


puted enough of them. We knew that we computed enough of them because as n goes 
large, sup{L(f, Pn) : n} = inf{U(f, Pr) : n}. 


Example 7.2.2. We compute the integral for f(a) = x? over [1,7]. For any positive inte- 
ger n let P, = {x0,...,%n} be the partition of [1,7] into n equal parts. By Exercise 7.1.1, 
Lp=1t ke, Since f is increasing on [1,7], on each sub-interval [v,_1,2,] the minimum 
is z?_, and the maximum is xz. Thus, by using Exercise 1.6.1 in one of the steps below: 


Section 7.2: Computing integrals from upper and lower sums 251 


k=1 

“ 12 36 \ 6 
=yo (148 +k? >| 

nm 

k=1 

6 196. oe Sey 
“Soe yee stew (S) 

k=1 

6  n(nt1) 126 n(n+1)(2n4+1) (6\?° 
n 2 nn 6 n 

36(n+1)  36(n+1)(2n+1 

=e is an me Ment ) 


so that U(f) < inf{U(f, Pn): n} =6+ 36+ 72 = 114. Similarly, 
L(f) = L(f, Pr) = 114, 
whence 114 < L(f) <U(f) < 114 and fi f = 114. 


Example 7.2.3. The goal of this exercise is to compute i x dx. For the first attempt, 
let P, = {Xo,---,2%n} be the partition of [0,2] into n equal intervals. By Exercise 7.1.1, 
i 2h The square root function is increasing, so that 


Py) = d. ViK(Lk — Le-1) 


But now we are stuck: we have no simplification of )77_, Vk, and we have no other 
immediate tricks to compute inf{(2)°/? Sawer 
But it is possible to compute enough upper and lower sums for this function to get 


the integral. Namely, for each positive integer n let Q, = {0, 2 aa) 25, 2), ee. ze ze = 
2}. This is a partition of [0,2] into n (unequal) parts, with x, = ze . Since \/ is an 


increasing function, on each sub-interval [x,-1, 7%] the minimum is achieved at x,—; and 
the maximum at xx, so that 


U(f. Qn) 2S Ge eé hoi) 


n2 n2 
k=1 


= RH (b-1)) 
k=1 
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= = 
k=1 


J2 n 
= OSH ee ot 
=1 


k=1 
= sane Hien a) as Z (by Exercise 1.6.1) 
= 2¥? (n + 1)(2(2n + 1) — 3) (by factoring) 
= =“ (n+ 1)(4n — 1). 


Thus U(f) < inf{U(f, P,) : n} = inf{34(n41)(4n—1) : n} = 42. Similarly, L(f) > 
ay2 so that. bi Je dx = 4/2 


Exercises for Section 7.2 
7.2.1. Mimic examples in this section to compute fe x? dx. (You may need Exercise 1.6.2.) 


7.2.2. Mimic examples in this section to compute i a 


7.3. What functions are integrable? 


Theorem 7.3.1. Every continuous real-valued function on [a,b] is integrable over [a,b], 
where a,b € R witha < b. 


Proof. Let f : [a,b] — R be continuous. We need to prove that L(f) = U(f). By 
Theorem 2.10.4 it suffices to prove that for all « > 0, U(f) — L(f) <«. 

So let « > 0. By Theorem 5.5.4, f is uniformly continuous, so there exists 6 > 0 such 
that for all x, c € |a, 6], if |a—c| < 6 then |f(x)— f(c)| < e/(b—a). Let P = {x, 21,...,2n} 
be a partition of [a,b] such that for all k =1,...,n, x, —x%p-1 < 6. (For example, this can 
be accomplished as follows: by Theorem 3.8.3, there exists n € NT such that (b—a) < nd, 
and then P can be taken to be the partition of [a,b] into n equal parts.) Then 


UP) dif, P) 
= )_ (supt{f(#) + @ € [ex-1, ve] } 


—inf{ f(x): a € [xx~_1, ex] }) (Ge — LeR_1) 


3 


y (we = 54) 


Section 7.3: What functions are integrable? 253 


(by uniform continuity since x, — ©R_1 < 0) 


n 


— C < a) Sie = Lk-1) 


k=1 


But U(f) < U(f, P) and L(f) = L(f, P), so that 0 < U(f)—-L(f) < U(f, P)-L(f, P) <«. 
Thus U(f) = L(f) by Theorem 2.10.4, so that f is integrable over [a, b]. 


Both the proof above as well as Theorem 7.1.16 prove that for a continuous function f 


on [a, b}, 

b Rr 

(r)\7..(7) (r) 
=e - 
: J ate so Flee (x Ty); 
k=1 

where for each r > 0, {a, al”, kgs a} is a partition of [a,b] into sub-intervals of length 
at most r, and for each k = 1,...,n,, Ce © [e_1, Lx]. 


Theorem 7.3.2. Let f : [a,b] > R be bounded. Let S = {80,81,...,5m} be a finite 
subset of [a,b] with so =a < 81 <-++ < 8m =. Suppose that for all d > 0 and allk f 
is integrable over [s,—1 + d,s, — d]. (By the previous theorem, Theorem 7.3.1, any f that 
is continuous at all x € [a,b] \ S satisfies this hypothesis.) Then f is integrable over [a,b], 


> Sk 


k=1% Sk-1 


and 


Proof. Let « > 0. Since f is bounded, there exists a positive real number M such that 
for all x € [a, 0], |f(z)| < M. Let e = } min{s1 — 59,82 — 81,...,8m — 8m—1} and d = 
min{e, €/(4M(m+1)(2m+1))}. By assumption, for each k = 1,...,m-—1, f is integrable 
on the interval [s,_; + d,s, —d]. Thus by Theorem 7.1.13, there exists a partition P; of 
[sp—1 +d, $, —d] such that U(f, P,) — L(f, Pp) < €/(2m+1). Now let P = {a} UP, UP,U 


---UPm-1U {b}. Then P is a partition of [a,b], and 


UL P)= LEP) 
2360 Paw eaeeayeaG Pom ee) 
apt U(f, PN [a,a+d]) — L(f, PA [a,a+d])) 
+ SUAPN be — dae +d) — LF, PM [oe — dye +) 
+ (UU, Pn b—4,8)) — Lf, Pn [b—4,8)) 


m—-1 
U(f, Px) — L(f, Pr) + 2M -2d+ S° 2M -2d+2M - 2d 
k=1 k=1 


IA 
M: 
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= € 
AM 1 
2 ea (m+ 1)d 


<€. 
Thus by Theorem 7.3.1, f is integrable. Furthermore, ie f=>5.4 ae f is bounded 


above by U(f, P) — )o¢_, L(f, Px) and below by L(f,P) — )°7_, U(f, Px), and the upper 
and lower bounds are within 2e of 0, so that ie ae oe f is within 2e of 0. Since e 


is arbitrary, it follows by Theorem 2.10.4 that ii {= es. fe, 


Notation 7.3.3. What could possibly be the meaning of bee f if a < b? In our definition 
of integrals, all partitions started from a smaller a to a larger b to get if. f. If we did 
reverse b and a, then the widths of the sub-intervals in each partition would be negative 
(Up —Lp—-1 = —(Lp-1 —Lx)), So that all the partial sums and both the lower and the upper 
integrals would get the negative value. Thus it seems reasonable to declare 


aa ks 


In fact, this is exactly what makes Theorem 7.3.2 work without any order assumptions 
on the sz. For example, if a < c < b, by Theorem 7.3.2, {fr = ue + - f, whence 


c b b b c 
Ue NP aad OO eat eae 
Theorem 7.3.4. Suppose that f and g are integrable over |a,b|, and that c € R. Then 
f +g is integrable over [a,b] and og +cg)= abs ft cfg. 


Proof. We first prove that L(cg) = U(cg) = cL(g) = cU(g). If c > 0, then 
L(cg) = sup{L(cg, P) : P a partition of [a, b]} 
= sup{cL(g, P) : P a partition of [a, b]} 
= csup{L(g, P) : P a partition of [a, b]} 
= cL(g) 
= cU(g) 
= cinf{U(g, P) : P a partition of [a, b]} 
= inf{cU(g, P) : P a partition of |a, b]} 
= inf{U(cg, P) : P a partition of |a, b]} 
= U(cg), 
and if c < 0, then 
L(cg) = sup{L(cg, P) : P a partition of [a, b]} 
= sup{cU(g, P) : P a partition of [a, b]} 
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= cinf{U(g, P) : P a partition of [a, b]} 
= cU(g) 
cL(g) 
= cinf{L(g, P): P a partition of [a, b]} 
= sup{cL(g, P) : P a partition of [a, b]} 
inf{U (cg, P) : P a partition of [a, b]} 
= U(cg). 


This proves that cg is integrable with f(g) = cf” g. 


Let « > 0. By integrability of f and cg there exist partitions P,Q of |a,b] such that 
U(f, P) — L(f, P) < €/2 and U(cg,Q) — Li(cg,Q) < ¢/2. Let R= PUQ. Then Risa 
partition of [a,b], and by Theorem 7.1.8, U(f, R)—L(f, R) < €/2 and U(cg, R)—Li(cg, R) < 
€/2. By Exercise 2.7.7, for every partition of [a,b], and in particular for the partition R, 
L(f +cg,R) 2 L(f, R) + L(cg, R), and U(f +cg,R) <U(f, R) + U(cg, R). Then 

0<U(f+cg) — L(f + cq) 
<€. 
Thus U(f + cg) — L(f + cg) = 0 by Theorem 2.10.4, and so f + cg is integrable. The 
inequalities L(f + cg,R) > L(f,R) + L(cg,R), and U(f + cg,R) < U(f,R) + U(cg, R) 
furthermore prove that L(f)+L(cg) < L(f+cg) < U(f+eg) < U(f)+U (cg) = L(f)+L (cg), 


so that finally 
frren= [+ fers frre fis 


Theorem 7.3.5. Let a,b € R witha < b. Let f,g : [a,b] + R be integrable functions 
such that f(x) < g(a) for all x € [a,b]. Then 


[tsfoo 


Here is a picture that illustrates this theorem: the values of g are at each point in 


the domain greater than or equal to the values of f, and the area under the graph of g is 
larger than the area under the graph of f: 
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Proof. By assumption on every sub-interval J of [a, b], inf{ f(a) : « € I} < inf{g(x): a ET}. 
Thus for all partitions P of [a,b], L(f,P) < L(g, P). Hence L(f) < L(g), and since f and 
g are integrable, this says that ii f< ss g. 


Exercises for Section 7.3 

7.3.1. The following is known: [} f =5, f) f =6, f° f = 15, [7g = —3. Compute and 
ele 2 3 1 

justify f (3f — 4g), Je 5f, {5 3g. 


x, if x is rational; 
ee a if x is irrational. 
i) Prove that f is not integrable over [0, 1]. 


ii) Does this contradict Theorem 7.3.2? Justify. 
7.3.3. Let f :[—a,a] > R be an integrable odd function. Prove that ee f=0: 
7.3.4. Let f : [—a,a] > R be an integrable even function. Prove that [", f =2 fo f. 


7.3.5. Let f : [a,b] > R be piecewise continuous as in Theorem 7.3.2. 

i) Prove that |f| is integrable over |a, bj. 

ii) Prove that ed < [ fl. 
7.3.6. Find a function f : [0,1] — R that is not integrable over [0,1] but such that | f| is 
integrable over [0, 1]. 


7.3.7. So far we have seen that every differentiable function is continuous and that every 
continuous function is integrable. 
i) Give an example of a continuous function that is not differentiable. 


ii) Give an example of an integrable function that is not continuous. 


*7.3.8. Let f : [a,b] + R be monotone. Prove that f is integrable over [a, }]. 
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7.4 The Fundamental theorem of calculus 


Despite first appearances, it turns out that integration and differentiation are related. 
For this we have two versions of the Fundamental theorem of calculus. 


Theorem 7.4.1. (The Fundamental theorem of calculus, I) Let f,g : [a,b] > R 
such that f is integrable over [a,b] and g is differentiable with g’ = f. Then 
b 
[ f=90)-9@. 
Proof. Let P = {x0,%1,...,%} be a partition of [a,b]. Since g is differentiable on [a, b], it 


is continuous on each [x,—1,2,| and differentiable on each (xp_1, 2%). Thus by the Mean 


value theorem (Theorem 6.3.4), there exists cy € (@,_1,2~%) such that f(cx.) = g'(ce) = 


g(#~)—g(@Kr-1) 


eae By the definition of lower and upper sums, 


LUGP) Sy F(a) Ge eat) SUP 


k=1 
But 


= =. — g(an-1)) 
hat 
= 9(n) — g(@o) 
= 9(0) — g(a), 
so that L(f,P) < g(b) — g(a) < U(f, P), whence 
L(f) =sup{L(f, P) : P a partition of |a, b]} 
< g(b) — ga) 
< inf{U(f, P) : P a partition of [a, b]} 
= U(f). 
Since f is integrable over [a,b], by definition L(f) = U(f), and so all inequalities above 


have to be equalities, so that necessarily i f =q(b) — g(a). 


The general notation for applying Theorem 7.4.1 is as follows: if g’ = f, then 


[1-90] 


= g(b) — g(a). 
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1422-2? 
(1+2?)2 


—— peli! sa ese 
5 ——— — — 
9 (1427)? eee hoe de> <p? 


For example, since is the derivative of ace it follows that 


If we instead had to compute this integral with upper and lower sums, it would take us a 
lot longer and a lot more effort to come up with the answer. 

In general, upper and lower sums and integrals are time-consuming and we want to 
avoid them if possible. The fundamental theorem of calculus that we just proved enables 
us to do that for many functions: to integrate f over [a,b] one needs to find g with g’ = f. 
Such g is called an antiderivative of f. For example, if r is a rational number different 

a”. By the 
scalar rule for derivatives, for any constant C, uv + C is also an antiderivative. It does 


from —1, then by the power rule (Theorem 6.2.10), an antiderivative of x” is 


not matter which antiderivative we choose to compute the integral: 


b 1 r+1 r+1 r+1 
prt + prt + 
[ va- +C)-— a +C)= agen 
7 r+1 r+1 Pepe, wear 


so that the choice of the antiderivative is irrelevant. 


Definition 7.4.2. (Indefinite integral) If g is an antiderivative of f, we write also 


/ f(a) de = g(a) +0, 


where C' stands for an arbitrary constant. 


For example, f 32? dx = 23+C, f3dx =32+C, ftdxr =tx+C, fxdu = }2?+C, 
and so on. (Study the differences and similarities of the last three.) 

So far we have seen 2” for rational exponents x. Exercise 5.4.7 also allows real 
exponents, and proves that this function of x is continuous. Thus by Theorem 7.3.1 this 
function is integrable. We do not yet know f{ 2% dx, but in Theorem 7.6.5 we will see that 
f2%dx = 4,27+C. For f 9(@*) dx instead, you and I do not know an antiderivative, 
we will not know one by the end of the course, and there actually is no “closed-form” 
antiderivative. This fact is due to a theory of Joseph Liouville (1809-1882). What is the 
meaning of “closed-form”? Here is an oblique answer: Exercise 10.1.5 claims that there 
exists an infinite power series (sum of infinitely many terms) that is an antiderivative 
of 2(2”), Precisely because of this infinite sum nature, the values of any antiderivative 
of 2(**) cannot be computed precisely, only approximately. Furthermore, according to 
Liouville’s theory, that infinite sum cannot be expressed in terms of the more familiar 
standard functions, and neither can any other expression for an antiderivative. It is in this 
sense that we say that 2(*) does not have a “closed-form” antiderivative. 


(It is a fact that in the ocean of all functions, those for which there is a “closed-form” 
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antiderivative form only a tiny droplet.) 

At this point we know very few methods for computing antiderivatives. We will in 
time build a bigger stash of functions: see the next section (Section 7.6) and the chapter 
on power series (Chapter 9). 

The simplest method for finding more antiderivatives is to first find a differentiable 
function and compute its derivative, and voila, the original function is an antiderivative of 
its derivative. For example, by the chain and power rules, (x? + 32)!°° is an antiderivative 
of 100(a? + 3x)99(2z 4+ 3). 


Theorem 7.4.3. (The Fundamental theorem of calculus, II) Let f : [a,b] > R be 
continuous. Then for all x € [a,b], f is integrable over [a,x], and the function g : [a,b] ~ R 


given by g(x) = Jf. f is differentiable on (a,b) with 


=f f= 10. 


Proof. Since f is continuous over [a, b], it is continuous over [a, x], so that by Theorem 7.3.1, 
f is integrable over [a,x]. Thus g is a well-defined function. Let c € (a,b). We will prove 
that g is differentiable at c. 

Let « > 0. By continuity of f at c, there exists 6 > 0 such that for all x € [a, }], if 
|2 —c| < 6 then |f(x) — f(c)| < «. Thus on [c—6,c+ 6] N [a,b], fic) —e < f(x) < flo) +e, 
so that by Theorem 7.3.5, 


[or Go-os [rr rs [ge 


min{z,c} min{z,c} min{z,c} 


But by integrals of constant functions, 


max{z,c} 
/ (f(c) + €) = (max{a,c} — min{z, c})(f(c) + €) = |a — e(f(e) £6). 


min{z,c} 


Thus 


max{z,c} 


eee < | f<le-d(fte). 


min{z,c} 
If « £ c, dividing by |” — c| and rewriting the middle term says that 


Hej=ce 2 yGias 


L—C 


LF 2 46) 


whence —e < Jet _ F(¢) <e, and 


<e. Then for all x € [a,b], if 0 < |x—c| < 4, 
then 
thos ae 


LC 


g(x) — g(c) #00) = 


wL—C 


ro 
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let 4 


(by Theorem 7.3.2 and Notation 7.3.3) 


<e€. 


Thus lim g(x) = (0) exists and equals f(c), i.e., g’(c) = f(c). 


rc v~—C 


It is probably a good idea to review the notation again. The integral le. f can also 


[=f r04e=f rox 


This is a function of x because x appears in the bound of the domain of integration. Note 
similarly that je f(t) dz are functions of t and x but not of z. Thus by the Fundamental 


be written as 


theorem of calculus, IT, 
d [* d [* 
af f= $@), nd Ff saz = sf, 


Do not write “/* f(x)dzx”: this is trying to say that x varies from a to x, so 
one occurrence of the letter x is constant and the other occurrence varies from a to that 
constant, which mixes up the symbols too much. 


Corollary 7.4.4. (The Fundamental theorem of calculus, II) Let f : [a,b] > R be 
continuous and let k,h be differentiable functions with codomain in [a,b]. Then 


d h(a) 


re f = f(h(a))h'(x) — f(R(a))k' (2). 
© Sk) 


Proof. Let g(x) = i f. By Theorem 7.4.3, g is differentiable and g' = f. Then 
de®) a h(x) k(z) 
ee ( for-fos 
XL Sk (x) dx a a 


= — (g'(h(x))h'(x) — g(k(x))k’(x)) (by the chain rule) 
= f(h(x))h'(x) — f(k(x))k'(2). 


Exercises for Section 7.4 
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7.4.1. Compute the integrals below. You may want to use clever guessing and rewriting. 
1 
i) | Vide = 
0 
1 
ii) i. 16x(a? + 4)’ dr = 
0 


1 
ii) [Bye +4)(0? +2043)" de = 
0 


7.4.2. Compute the integrals below, assuming that t and x do not depend on each other. 


1 
» | a? dz = 
0 
1 
ii) i d= 
He 
iii) i x? dt = 
0 


7.4.3. Below t and x do not depend on each other. Compute the following derivatives, 


possibly using Theorem 7.4.3. 


7.4.4. Suppose that f : [a,b] + R* is continuous and that f(c) > 0 for some ¢ € [a, }]. 
Prove that Nhs 70: 
7.4.5. (Integration by substitution) By the chain rule for differentiation, (fo g)'(x) = 
F'(g(a))9'(@). 

) Prove that J?’ f’(g(«))g/(«) dx = f(g(b)) — F(g(a)). 
ii) Prove that [ f’(g(x))g’'(x) dx = f(g(x)) + C. 

) Compute the following integrals applying this rule explicitly stating f, g: 


3 
[ e2-4)ae = 
2 
+ Ag +3 
gdxz = 
. V 2x? + 3x 
[ Ge +6) 20? + 30 dx = 


1 
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7.4.6. (Integration by parts) By the product rule for differentiation, (f - g)'(x) = 


f'(x)g(@) + F(a)g' (a). 


(x 
i) Prove that f” f"(x)g(«) de = one = = en (x 
ii) Prove that [ f’(x)g(x) dx = Be 
iii) Conus the aie ee ae this = hon stating f, g: 
ie (42 + 3)(5¢ +1) dx = 
pt 4x +3 


——_—_ dr = 
1 V2x+4 


7.4.7. Compute the following derivatives. (Hint: the Fundamental theorem of calculus 
and the chain rule.) 


5 \V/t4 + Sv dt. 


ape t+5Vt 


ii) Ge | 2 M00 — 2680407 — 2 


4 —h(ax) 
ili) — t) dt. 
Def, 10 


7.4.8. (Mean value theorem for integrals) Let f : [a,b] — R be continuous. Prove 
that there exists c € (a,b) such that 
l b 
=. i; f 


7.4.9. Let f have derivatives of order up to n + 1 on the interval [a, }]. 


+f f'(t) dt 


ii) Integrate the integral above by parts, and rewrite, to get that 


dt. 


i) Justify how for any zx in |{a, d}, 


x 


f(x) = f(a) + (@— a) f"(a) + / (a —t) f(t) dt. 


a 


iii) Use induction, integration by parts, and rewritings, to get that f(x) equals 


MG (n) ee 
FC cl) eee reer u e- ans fo AO" Ho ae 


n! 


iv) Say why you cannot apply the Fundamental theorem of calculus II, to compute 
el, DT PO eat 


v) (Taylor’ remainder formula in integral form) Consult Section 6.5 for Taylor poly- 
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nomials to prove that 


fa) = Tahal @) +f 


a 


ERE pon esa 


7.4.10. (Improper integral, unbounded domain) Let f : [a,oo) — R be continuous. 
i) Discuss how our construction/definition of integrals fails when the domain is not 
bounded. 
ii) Prove that for all N € [a, oo), iy f exists. 
iii) If limy—+oo (el f exists, we call the limit the (improper) integral of f over [a, co). 
We denote it i f. Observe that this is a limit of limits. Similarly formulate the 


definition of flues g for a continuous function g : (—co,b] > R. 
avieesea: ‘ad 14 ita 10: 


0, otherwise. 
7.4.11. Use Exercise 7.4.10. 
i) Compute [7° 4dz. 


ii) For any rational number p < —1, compute {o> x? dx. (The same is true for real 


. Compute and justify ee ie 


p< —1, but we have not developed enough properties for such functions.) 


7.4.12. (Improper integral, unbounded domain) Let f : R — R be bounded such 
that for some c € R, [°.. f and f~ f exist in the sense of Exercise 7.4.10. 
i) Prove that for alle € R, f°. f and f- f exist and that 


eg ed 
We denote this common value as Yas, Pe 
ii) Prove that jee f= lms lia he 


7.4.13. Let f : R — R over R. This exercise is meant to show that integrability of f 
over R cannot be simply defined as the existence of the limit limy_—.. est f. Namely, let 


eS { Ly i ae] (the floor of x) is even; 
—1, otherwise. 
i) Sketch the graph of this function. 
ii) Prove that for every positive real number JN, i f=. 
iii) Prove that f is not integrable over |[0,00) or over (—oo,0) in the sense of Exer- 
cise 7.4.10. 
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7.4.14. (Improper integral, unbounded function) Let f : (a,b] > R be continuous. 
i) Discuss how our construction/definition of integrals fails when the domain does not 
include the boundaries of the domain. 
ii) Prove that for all N € (a,b), ie f exists. 
iii) If limy_,,+ fr f exists, we call this limit the (improper) integral of f over [a, }J, 
and we denote it Hie f. Similarly formulate fe f if the domain of f is [a,b) or (a,b). 
7.4.15. Use Exercise 7.4.14. 
i) Compute ie gad 
ii) For any rational number p > —1, compute ie x? dx. (The same is true for real 
p > —1, but we have not developed enough properties for such functions.) 
iii) Let f : (a,b) > R be given by f(x) = 1. Prove that [fr =b—a. 


7.5 Integration of complex-valued functions 


So far we have ree neta of oe. functions. By Theorem 7.4.1, if g = f, 

then Se = g(b) — g(a), ie., fe g’ = g(b) — g(a). If g is complex-valued, we know that 

= (Reg) +i(Im iy , so that it would ie sense to define is g’ as the integral of (Re g)’ 
plus 7 times the integral of (Im g)’. Indeed, this is the definition: 


Definition 7.5.1. Let f : [a,b] — C be a function such that Re f and Im f are integrable 
over |a,b]. The integral of f over [a, b] is 


frm foversifims 


The following are then immediate generalizations of the two versions of the funda- 
mental theorem of calculus Theorems 7.4.1 and 7.4.3: 


Theorem 7.5.2. (The Fundamental theorem of calculus, I, for complex-valued 
functions) Let f,g : [a,b] > C such that f is continuous and g is differentiable with 


g' = f. Then 
b 
[ f=90)-9(@. 
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Theorem 7.5.3. (The Fundamental theorem of calculus, II, for complex-valued 


functions) Let f : [a,b] + C be continuous. Then for all x € |a,b], f is integrable over 


[a,x], and the function g : [a,b] + R given by g(x) = J. f is differentiable on (a,x) with 


=f t=#@. 


Exercises for Section 7.5 


7.5.1. Work out Exercises 7.3.3 and 7.3.4 for complex-valued functions. 


7.5.2. This is a generalization of Exercise 7.3.5. Let f : [a,b] > C be integrable. Prove 


that | i 


f| = ‘fs |f|. (Hint: Write ie =s-+ti for some s,t € R. Integrate (s — ti) f.) 


7.6 Natural logarithm and the exponential functions 


The function that takes a non-zero x to 1/x is continuous everywhere on its domain 


since it is a rational function. Thus by Theorem 7.3.1 and Notation 7.3.3, for all x > 0, 


te +dax is well-defined. This function has a familiar name: 


Definition 7.6.1. The natural logarithm is the function 


al 
Inz = —dt 
1 t 


for all x > 0. 


We prove below all the familiar properties of this familiar function. 


Remark 7.6.2. 


Inl= f, 4dt=0. 

By geometry, for « > 1, na = f° $dt > 0, and for x € (0,1), na = J," }dt = 
=f ide 0: 

By the Fundamental theorem of calculus (Theorem 7.4.3), for all b € R™, In is 
differentiable on (0, b), so that In is differentiable on R*. Furthermore, In’(z) = +. 
In is continuous (since it is differentiable) on R*. 

The derivative of In is always positive. Thus by Theorem 6.3.5, In is everywhere 


increasing. 


Let c € R*, and set g(x) = In(cx). By the chain rule, g is differentiable, and 


g(x) = +e = 4 =In'(z). Thus the function g — In has constant derivative 0. It 
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(11) 


(12) 


(13) 
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follows by Theorem 6.3.5 that g — In is a constant function. Hence for all « € R™, 
In(cx) — In(x) = g(x) — In(x) = g(1) — n(1) = In(c) — 0 = In(c). 
This proves that for all c,z € R‘, 


In(cx) = In(c) + In(z). 


By the previous part, for all c,z € RT, 
Cc 
l (<) = In(c) — In(z). 
n{- n(c) — In(z) 


For all non-negative integers n and all c € R™, In(c”) = nIn(c). We prove this 
by mathematical induction. If n = 0, then In(c”) = In(1) = 0 = Olnc = nInc. 
Now suppose that equality holds for some n — 1. Then In(c”) = In(c”~!c) = 
In(c”~1) + In(c) by what we have already established, so that by the induction 
assumption In(c”) = (n — 1) In(c) + In(c) = nIn(c). 


For all rational numbers r and all c € R™, In(c”) = rIn(c). Here is a proof. We 
have proved this result if r is a non-negative integer. If r is a negative integer, 
then —r is a positive integer, so that by the previous case, In(c”) = In(1/c"") = 
In(1) — In(c~") = 0 — (-r) In(c) = rIn(c), which proves the claim for all integers. 
Now write r = ™ for some integers m,n with n 4 0. Then nIn(c") = n In(e?) = 
In(c™) = mIn(c), so that In(c") = “ In(c) = rIn(c). 


The range of In is R = (—oo, 00). Here isa proof. By geometry, In(0.5) < 0 < In(2). 
Let y € R*. By Theorem 3.8.3, there exists n € N* such that y < nln(2). Hence 
nl =0<y< nln(2) = In(2”), so that since In is continuous, by the Intermediate 
value theorem (Theorem 5.3.1), there exists x € (1,2”) such that In(x) = y. If 
y € R’, then by the just proved we have that —y = In(x) for some x € R™, so 


that y = —In(x) = In(a~"). Finally, 0 = In(1). Thus every real number is in the 


range of In. 


Thus In : R* > R is a strictly increasing continuous and surjective function. Thus 


by Theorem 2.9.4, In has an inverse In + RS RE, By Theorem 5.3.4, In? is 


increasing and continuous. 


By Theorem 6.2.7, the derivative of In~' is 


(In~*)’(a#) = 1 = In“'(z). 


For all x,y € R, 
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= In7*(In(In7*(x)) — In(In~*(y))) 
=In ‘(x -—y). 


We have proved that for all c € R* and r € Q, In-'(rin(c)) = In“‘(In(c")) = ec", 
and we have proved that for all r € R, In~'(r In(c)) is well-defined. This allows us to define 


exponentiation with real (not just rational) exponents: 


Definition 7.6.3. Let ce R* andr eR. Set 
c" =In'(rIn(o)). 


This immediately gives rise to two functions: 
(1) The generalized power function with exponent r when c varies and r is con- 
stant; 
(2) The exponential function with base c when r varies and c is constant. 
(3) We refer to the exponential function In~* with base c = In~'(1) as the exponen- 
tial function (so the base is implicit). 


Theorem 7.6.4. Let r € R. The function f : Ré > R* given by f(x) = 2” is dif- 
ferentiable, with f’(x) = rx™~'. This function is increasing if r > 0 and decreasing if 
r<0. 


Proof. By definition, f(a) = In~'(rIn(x)), which is differentiable by the chain and scalar 
rules and the fact that In and its inverse are differentiable. Furthermore, the derivative is 
f'(e) =m V(rin(z))- 2 = ee = rin '(rin(z) — In(x)) = rIn“!((r — 1) In(z)) = 
raz”—', The monotone properties then follow from Theorem 6.3.5. 


Theorem 7.6.5. Let c € R‘. The function f : R > R* given by f(x) = c® is dif- 
ferentiable, and f'(a) = (In(c))c*. This function is increasing if c > 1 and decreasing if 
c € (0,1). 


Proof. By definition, f(a) = In~'(2In(c)), which is differentiable by the chain and scalar 
rules and the fact that In~' is differentiable. Furthermore, the derivative is f/(x) = 
In~*(aIn(c)) - n(c) = f(x) - n(c) = (In(c))c?. The monotone properties then follow from 
Theorem 6.3.5. 


We next give a more concrete form to the exponential function In~*. 
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Definition 7.6.6. Let e = In~'(1) (so that In(e) = 1). The constant e is called Euler’s 


constant. 


Since In(e) = 1 > 0 = In(1), by the increasing property of In it follows that e > 1. 

Now let f(x) = 1/x. This function is non-negative on [l,oo). If P is a partition 
of [1,3] into 5 equal parts, then L(f,P) = 0.976934, if P is a partition of [1,3] into 6 
equal parts, then L(f, P) & 0.995635, and if P is a partition of [1,3] into 7 equal parts, 
then L(f,P) = 1.00937. This proves that L(f) > 1 over the interval [1,3]. On [1,3], the 
function f is continuous and thus integrable, so that In3 = ie f >1=JIme. Since In is an 
increasing function, this means that e < 3. By geometry In(2) < U(f, {1,2}) = 1 = Ine, 
so that similarly e > 2. We conclude that e is a number strictly between 2 and 3. 

Note that U(f, {1, 1.25, 1.5, 1.75, 2, 2.25, 2.5}) = 0.25(1+74.+ 4+ p++) = 
eon <1, so that n2.5 = fie f is strictly smaller than this upper sum. It follows that 
In 2.5 <1=Ine and 2.5 < e. If Pisa partition of [1, 2.71828] into a million pieces of equal 
length, a computer gives that U(f, P) is just barely smaller than 1, so that 2.71828 < e. 
If P is a partition of [1, 2.718285] into a million pieces of equal length, then L(f, P) is just 
barely bigger than 1, so that e < 2.718285. Thus e = 2.71828. 

A reader may want to run further computer calculations for greater precision. A dif- 


ferent and perhaps easier computation is in Exercise 7.6.14. 


Theorem 7.6.7. (The exponential function.) For all x € R, In '(x) = e*. 


Proof. By the definitions, 


e* = In "(x -In(e)) = n+ (a: In(In71(1))) = In“*(2- 1) = In" (za). 


We have already proved on page 267 that the derivative of In~! is In7!: 


Theorem 7.6.8. For all x € R, (e”)/ =e”. 


Exercises for Section 7.6 


7.6.1. Prove that for all x > 0, 1/(1+2) < In(1+2)-—Inz < 1/2. (Hint: Use the geometry 
of the definition of In.) 
7.6.2. Let ce Rt. 
i) Prove that for all 2 € R, c® = elm") = er PO), 
ii) Prove that if c #1, then f. ode = mac” —c%), 
7.6.3. Use integration by substitution (Exercise 7.4.5) to compute [ gqa ae. 


7.6.4. Let c€ R™. Use integration by substitution (Exercise 7.4.5) to compute A ac’®”) dr. 
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7.6.5. Let c€ R*. Use integration by parts (Exercise 7.4.6) to compute | wera, 
7.6.6. Use integration by parts (Exercise 7.4.6) to compute f In(x) dz. 


7.6.7. Prove by integration by parts the following improper integral value for a non- 

negative integer n: 

1 
1 
2n 

Se ie =e 

/ ( ) (2n + 1)2 
7.6.8. (Logarithmic differentiation) Sometimes it is hard or even impossible to com- 


pute the derivative of a function. Try for example f(x) = 2%, f(x) = (x? + 2)°*4, or 

f(z) == x? (a—1)3(a+4)3Ve241 
Y%xt+2(x+7)3(a—2)4 

positive real numbers: apply In of both sides, take derivatives of both sides, and solve for 


f'(z). For example, if f(x) = x”, then In(f(«)) = In(a#”) = x In(z), so that 
f(a) 
f(x) 
so that f’(x) = x*(In(z) + 1). 
i) Compute and justify the derivative of f(x) = (x? +2)" +4. 
a? (1)? etd) a EL 
Wat2(2+7)3(x—2)4 * 
7.6.9. This exercise is about applying various versions of the L’H6pital’s rule. 
i) Prove that lim “* =0. 


w—->0O 


. There is another way if the range of the function consists of 


= (In(f(2)))’ = (en(a))’ = Ine) + = = In(z) +1, 


ii) Compute and justify the derivative of f(x) = 


ii) Prove that lim xlnxz = 0. (Hint: Use one of eng = Tine OF tina = Be, 
xz—>0t nee /« 


Perhaps one works and the other does not; we can also learn from attempts that 
do not lead to a successful completion.) 
iii) Compute and justify lim In(2”). 
xz—0t 


iv) Compute and justify lim 2”. 
a2—0t 


1 
7.6.10. Prove that lim, —In(1+ <2) = 1. (Hint: L’Hopital’s rule.) 
x—-O0T X 


i) Prove that for any c € R*, lim xin (1 + <) =c. (Hint: Change variables.) 
w~—->0oO x 


ii) Prove that for any c € Rt, lim (1 + <) =e. 
a 


LOO 
7.6.11. Prove that f(x) = (1++4)* isa strictly increasing function. (Hint: use logarithmic 
differentiation and Exercise 7.6.1.) 


7.6.12. Prove that for all x € R, (1+ 4)” <e. (Hint: previous two exercises.) 


7.6.13. Prove that lim,-49 gah = 1. You may want to apply L’Hopital’s rule (Theo- 
rem 6.4.2). 
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7.6.14. Let f be the exponential function with f(x) = e® for allz ER. 
i) For any positive integer n, compute the Taylor polynomial T}, +9 for e” of degree n 
centered at 0 (Definition 6.5.2). 
ii) Use Taylor’s remainder theorem (Theorem 6.5.5) to show that for any x € R and 
any € > 0 there exists n € N such that |f(x) — Tn, ,0(a)| < €. 
iii) Use the Taylor’s remainder theorem (Theorem 6.5.5) to prove that |e — Tx, po(1)| = 
let — Tg ¢.0(1)| < 0.00001. 
iv) Compute Tg 70(1) to 7 significant digits. 
v) (Unusual) We computed some digits of e on page 269. On a computer, try to get 
more digits of e with those methods and separately with methods in this exercise. 
Which method is faster? Can you streamline either method? 


7.6.15. Let f(x) =e”. Use L’Hopital’s rule (Theorem 6.4.2) and induction on n to prove 
that 


im © lefol@) 

x0 gurl (n+ 1)! 
. ee” if, # 0; 
7.6.16. Let f :.R—R be given by f(x) = a : 
0, ie =, 


i) Prove by induction on n > 0 that for each n there exists a polynomial function 
hyn: R— R such that 
f(x) = stele) ae if x #0; 
0, hee Ae 
(At non-zero x you can use the chain rule, the derivative of the exponential function, 
and the power rule for derivatives. However, f("+")(0) is (but of course) computed 
as limp_so (a LO and then you have to use L’Hopital’s rule. You do not 
have to be explicit about the polynomial functions h,,.) 
ii) Compute the nth Taylor polynomial for f centered at 0. 
7.6.17. (A friendly competition between e and 7) Use calculus and not a calculator to 


determine which number is bigger, e” or 7©. You may assume that 1 < e < a. (Hint: 


compute the derivative of some function.) 
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7.7 Applications of integration 


The Fundamental theorems of calculus relate integration with differentiation. In 
particular, to compute fe f, if we know an antiderivative g of f, then the integral is easy. 
However, as already mentioned after the Fundamental theorem of calculus I, many functions 
do not have a “closed-form” antiderivative. One can still compute definite integrals up to 
a desired precision, however: we take finer and finer partitions of [a,b], and when U(f, P) 
and L(f, P) are within a specified distance from each other, we know that the true integral 
is somewhere in between, and hence up to the specified precision either L(f, P) or U(f, P) 
stands for es f. In applications, such as in science and engineering, many integrals have to 
be and are computed in this way because of the lack of closed-form antiderivatives. 

In this section we look at many applications that exploit the original definition of 
integrals via sums over finer and finer partitions. For many concrete examples we can then 
solve the integral via antiderivatives, but for many we have to make do with numerical 


approximation. 


7.7.1 Length of a curve 


Let f : [a,b] — R be a continuous function. If the graph of f is a line, then by 
the Pythagorean theorem the length of the curve is \/(b— a)? + (f(b) — f(a))?. For a 
general curve it is harder to determine its length from (a, f(a)) to (b, f(b)). But we can do 


the standard calculus trick: let P = {x%0,71,...,%,} be a partition of [a,b]; on each sub- 
interval [x,_1, 2%] we “approximate” the curve with the line (a,_1, f(x,_1)) to (xg, f(@x)), 
compute the length of that line as \/(x~ — n-1)? + (f (ae) — f(@p—1))2, and sum up all 
the lengths: 


Sy Vee tea Ge) fea): 
k=1 


Whether this is an approximation of the true length depends on the partition, but geomet- 
rically it makes sense that the true length of the curve equals 


lim 5° J (ae — te-1)2 + (f (ee) — f(ae-1))?, 
k=1 


as the partitions {xvo,71,...,%,} get finer and finer. But this is not yet in form of The- 
orem 7.1.16. For that we need to furthermore assume that f is differentiable on (a,b). 
Then by the Mean value theorem (Theorem 6.3.4) for each k = 1,...,n there exists 
Ck € (@~-1,2%) such that f(a,) — f(ae-1) = Ff’ (ce)(@e — Ze-1). If in addition we as- 
sume that f’ is continuous, then it is integrable by Theorem 7.3.1, and by Theorem 7.1.16 
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the true length of the curve equals 


lim V (tn — &e—1)? + (f' (ce) (@e — Be-1))? 


P={2x0,%1,..., an }finer,c, €[z~-1,0k] ra 


= lim So 1+ (F'(cx))? (ee — te-1) 


P={x0,21 gisele Xn }finer,c, €[r@~-1,0%] k=l 


We just proved: 


Theorem 7.7.1. If f : [a,b] > R is continuous and differentiable such that f’ is continuous 
on [a,b], then the length of the curve from (a, f(a)) to (b, f(b)) is 


b 
/ J1t+ (f'(x))? da. 


7.7.2 Volume of a surface area of revolution, disk method 


Let f : [a,b] — R be continuous. We rotate the region between x = a and x = b and 


bounded by the x-axis and the graph of f around the z-axis. If the graph is a horizontal 
line, then the rotated region is a disk of height b — a and radius f(a), so that its volume 
is (f(a))?(b — a). For a general f we let P = {x0,21,...,2n} be a partition of [a,b]; on 
each sub-interval [x,_1,7,] we “approximate” the curve with the horizontal line y = f (cx) 
for some cy € [Xp-1,~], we compute the volume of the solid of revolution obtained by 
rotating that approximated line over the interval |7,_1, 7%] around the x-axis, and sum up 


all the volumes: . 


> (f (ce)? (en — 24-1)- 


k=1 
Geometrically it makes sense that the true volume equals 


n 


lim m(f(ck))? (ae — te-1); 
PHF 215-5 eee (FC r)) ( e 1) 


and by Theorems 7.1.16 and 7.3.1, this equals 7 LF@)yP? dx. This proves: 


Theorem 7.7.2. If f : [a,b] + R is continuous, then the volume of the solid of revolution 


obtained by rotating around the x-axis the region between x = a and x = b and bounded 
by the x-axis and the graph of f is 


a) “(f(@))% de. 
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7.7.3 Volume of a surface area of revolution, shell method 


Let 0 < a < 6, and let f,g : [a,b] — R be continuous such that for all x € |a, bj 
f(x) < g(x). We rotate the region between y = a and y = b and bounded by the graphs 
of x = f(y) and x = g(y) around the z-axis. If f(z) = c and g(x) = d are constant 
functions, then the solid of revolution is a hollowed disk, with the outer border of height 
d—c and radius b and the hole inside it has the same height but radius a. Thus the volume 
is 7(d — c)(b? — a). For general f and g we let P = {yo,41,---,Yn} be a partition of 
[a,b]; on each sub-interval [y,—1, yx] we “approximate” the curve f,g with the horizontal 
line f (cr), 9(ce) for some cr € [Yr—1, Ye], We compute the volume of the solid of revolution 
obtained by rotating that approximated region over the interval [y,—1, yx] around the x- 
axis, and sum up all the volumes: 


n 


Y= (g(x) — fee) (YR — YR-1): 


k=1 
Geometrically it makes sense that the true volume equals 


n 


lim tT(g(c.) — fle aeee yaad 
See eet = (9(cr) — f(Ck)) (YE — YE—1) 


n 


= lim ™(G(ck) — FC) )(Ye+Ye—1)(Ye-Ye—1); 
P={yoYry--5 Yn ffiner,cr €[yxr—1;Yk] k=l 


and by Theorems 7.1.16 and 7.3.1, this equals 7 La) — f(y))2y dy. This proves: 


Theorem 7.7.3. If0 <a< band f,g: [a,b] + R are continuous, then the volume of the 
solid of revolution obtained by rotating around the x-axis the region between y = a and 
y = b and bounded by the graphs of x = f(y) and x = g(y), is 


b 
om . y(a(y) — F(y)) dy. 


Example 7.7.4. The volume of the sphere of radius r is amr, 


Proof. We rotate the upper-half circle of radius r centered at the origin around the z- 
axis. The circle of radius r centered at the origin consists of all points (x,y) such that 


xg? + y? =r?, so we have g(y) = ./r2 — y? and f(y) = —\/r? — y2. Thus the volume is 
: 4 > ow 4 
anf yvV/r? — y? dy = (-5a0 ~ vy") 3 
0 


; = up a = 30" ‘ 
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7.7.4 Surface area of the surface area of revolution 


It takes quite a few steps to analyze the simple case before we get to an integral. 
We first rotate the line segment y = max from x = 0 to x = b > 0 around the z-axis. 
We assume for now that m 4 0. 


In this way we obtain a right circular cone of height 6 and base radius |m|b. The 
perimeter of that base circle is of course 27|m|b. If we cut the cone in a straight line from 


a side to the vertex, we cut along an edge of length ,/b? + (mb)?, and we get the wedge as 
follows: 


perimeter 27|m|b 


radius 
b? + (mb)? 


Without the clip in the disc, the perimeter would be 27 ,/b? + (mb)?, but our perime- 
ter is only 27|m|b. Thus the angle subtended by the wedge is by proportionality equal to 


2n|m|b —— 2n|m| 
Qny/b?-+(mb)? Vi+m?" 
the full angle, and so proportionally the area of our wedge is radius squared times one half 


The area of the full circle is radius squared times one half of 


2_ 2n|m| 
2V1+m?2 
= n|m|b?V1+m?. Note that even if b < 0, the surface area is the absolute value of 


mmb?V/1 + m2. 

Thus if m is not zero and 0 <a < bora <b<0, then the surface area of revolution 
obtained by rotating the line y = mx from x = a to x = b equals the absolute value of 
nmm(b? — a?)V/1+ m2. Note the geometric requirement that at a and b the line is on the 
same side of the x-axis. 


of our angle, i.e., the surface area of this surface of revolution is (./b? + (mb)?) 
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Now suppose that we rotate the line y = ma-+/ around the z-axis, with m 4 0 
and a < 6 and both on the same side of the intersection of the line with the z-axis. This 
intersection is at « = —l/m. By shifting the graph by //m to the right, this is the same as 
rotating the line y = mz from x = a+1/m to x =b+1/m, and by the previous case the 
surface area of this is the absolute value of 


am((b + 1/m)? = (a + 1/m)?) V1 + m2 
= mm (b? + 2bl/m + 1?/m? — (a? + 2al/m + I? /m?)) V/1 + m2 
= mm (b? — a? + 2(b—a)l/m) Jl +m2 

mm(b — a)(b +a + 2l/m)V1 + m2 

m(b—a)(m(b + a) + 21)\/1+ m?. 


If instead we rotate the line y = | (with m = 0) around the z-axis, we get a ring whose 
surface area is (b — a)2z|1|, which is the absolute value of 1(b— a)(m(b+ a) + 21)V//1 + m?. 
Thus for all m, the surface area of the surface of revolution obtained by rotating the line 
y= max-+l from « =a to x = b around the z-axis is the absolute value of 


m(b—a)(m(b+ a) + 21) /1+m?, 


with the further restriction in case m #¢ 0 that a and 6b are both on the same side of the 
x-intercept. 

Now let f : [a,b] + Rso be a differentiable (and not necessarily linear) function. Let 
P= {x0,%1,...,%n} be a partition of [a, b]; on each sub-interval [x,_1, x,] we approximate 
the curve with the line from (a,_1, f(p_-1)) to (xx, f(a~)). By the assumption that f 
takes on only non-negative values we have that both 7,_1, x, are both on the same side of 
the z-intercept of that line. The equation of the line is y = f@#)=fltx-v) (cx — ay) + f(x), 


Tk-Tk-1 


so that m — £@0=f@r-) ang 1 — —f@e=f@r-) f(ax). Since f is differentiable, by 


Lk-Lk-1 Lk-Lk-1 


the Mean value theorem (Theorem 6.3.4) there exists cy, € (%p_1,%~) such that f’(c,) = 
flee)“ Fler) Thus m = f’(cy) and 1 = f(x) — f’ (ce) xk. 


L_-LE-1 
We rotate that line segment around the x-axis, compute its volume of that solid of 


revolution, and add up the volumes for all the subparts: 


Ye fw —en1) (fF (ee) (wnt an—1)+2 (F (we) —F"(cx)ee)) V+ Few) 
k=1 


= Soa |(f'(ce)(—tx + te-1) + 2f (ee) V1 + CF (ce))? (ae — @e-1)- 
k=1 


By Theorems 7.1.16 and 7.3.1 we get the following: 
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Theorem 7.7.5. If f : [a,b] + Rso is differentiable with continuous derivative, then the 
surface area of the solid of revolution obtained by rotating around the x-axis the curve 
y = f(x) between x = a and x = 0 is 


b b 
m ie If («)(—a-+a)+2f(2)| VIF Ce)? de = 2m / f(a) JT wy de. 


Exercises for Section 7.7 


7.7.1. Use the methods from this section to compute the following: 

i) The perimeter of the circle. 

ii) The volume of the sphere of radius r. 

iii) The volume of the ellipsoid whose boundary satisfies = a ye ae a = 1. (We need 
multi-variable calculus to be able to compute the volume of the ellipsoid whose 
boundary satisfies is + ye of a =ls) 

iv) The volume of a conical pyramid with base radius r and height h. 

v) The volume of a doughnut (you specify its dimensions). 

vi) The surface area of the sphere of radius r. 

7.7.2. The moment of inertia of a tiny particle of mass m rotating around a circle of radius 
ris I =mr?. Likewise, the moment of inertia of a (circular) hoop of radius r and of mass 
m rotating around its center is J = mr?. The goal of this exercise is to derive the moment 
of inertia of a thin circular plate of radius 5a meters and mass 4 kilograms rotating about 
its diameter. (Let 6 be the thickness of the plate and p the mass divided by the volume.) 

i) Center the plate at the origin. Let the axis of rotation be the y-axis. 

ii) Let P be a partition of [0,5]. Prove that the moment of inertia of the sliver of 
the plate between x,_; and xz is approximately Ss / x3, — pee where cy € 
eats eel 

iii) Prove that the moment of inertia of the rotating circular plate is 50 kilogram meters 
squared. 

7.7.3. Ifaconstant force F' moves an object by d units, the work done is W = Fd. Suppose 
that F depends on the position as F(x) = ka for some constant k. Find the total work 
done between x = a and x = b. 


7.7.4. In hydrostatics, (constant) force equals (constant) pressure times (constant) area, 
and (constant) pressure equals the weight density of the water w times (constant) depth 
h below the surface. But most of the time we do not have tiny particles but large objects 
where depth, pressure, and surface areas vary. For example, an object is completely sub- 
merged under water from depth a to depth 6. At depth h, the cross section area of the 
object is A(h). Compute the total force exerted on the object by the water. 


Chapter 8: Sequences 


In this chapter, Sections 8.5 and 8.4 contain identical results in identical order, but 
the proofs are different. You may want to learn both perspectives, or you may choose to 
omit one of the two sections. 


8.1 Introduction to sequences 


Definition 8.1.1. An infinite sequence is a function with domain N* and codomain C. 
If s is such a function, instead of writing s(n), it is common to write s,, and the sequence 
s is commonly expressed also in all of the following notations (and obviously many more): 


s= {$1, 52, 53,-- + = {Sntn=1 me {Sn}n>1 = {$n }nent 
= {sn}n = irae ee poe = 1A tes = {Sn}. 


The nth element s, = s(n) in the ordered list is called the nth term of the sequence. 


The notation {s1, 52,53,...} usually stands for the set consisting of the elements 
$1, 82, 83,..., and the order of a listing of elements in a set is irrelevant. Here, however, 
{$1, 2, 53,--.} stands for the sequence, and the order matters. When the usage is not 
clear from the context, we add the word “sequence” or “set” as appropriate. 

The first term of the sequence {2n — 1},,>4 is 7, the second term is 9, et cetera. The 
point is that even though the notating of a sequence can start with an arbitrary integer, 
the counting of the terms always starts with 1. 

Note that s, is the nth term of the sequence s, whereas {s,} = {5n}n>1 is the 
sequence in which n plays a dummy variable. Thus 


a ee 


Examples and notation 8.1.2. 
(1) The terms of a sequence need not be distinct. For any complex number c, {c} = 
{c,¢,¢,...} is called a constant sequence. 
(2) For any complex numbers c and d, the sequence {c, d,c,d,c,d,...} can be written 
more concisely as {45£(—1)" + £7},,. 
(3) The sequence {(—1)"} = {—1,1,—1,1,...} has an infinite number of terms, and 
its range is the finite set {—1,1}. The range of the sequence {i”} is the set 
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(4) 


(5) 


(10) 
(11) 


(12) 
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The sequence s with s(n) = n+ 4 for all n > 1 can be written as {n+ 4} = 
{n+ 4}n>1 = {5,6,7,...} = {n}n>5. This sequence is different from the sequence 
ey = tl 2 Be kd ets 

The sequence {2n} is the sequence {2, 4, 6, 8,...}. It is not the same as the 
sequence {4,2,8,6,12,10,...}, namely, if we scramble the order of the terms, we 
change the sequence. 

The sequence of all odd positive integers (in the natural order) can be written as 
{2n — 1} = {2n — 1}ns1 = {2n + 1}nso (and in many other ways). 

Terms of a sequence can be defined recursively. For example, let s; = 1, and for 
each n > 1 let 5,41 = 2s,. Then the sequence of these s,, has a non-recursive form 
Sn = 2"—!. If however, s; = 1 and for each n > 1, Sar = 283 +1/sp + 24+1n(s,)+1, 
then we can certainly compute any one s, by invoking s, for k < n, but we do 
not get a closed-form for s,, as in the previous example. 

(Fibonacci numbers) Let s; = 1, s2 = 1, and for all n > 2, let 5,41 = 8)+5n-1. 
This sequence starts with 1,1, 2,3, 5,8, 13,21, 34,..., and obviously keeps growing. 
(For more on these numbers, see Exercise 1.6.30, where in particular it is proved 
that for all integers n, sy) = a (444)" 77 (4)") 


Some sequences, just like functions, do not have an algebraic expression for terms. 


For example, let s be the sequence whose nth term is the nth prime number. 
This sequence starts with 2, 3,5, 7,11, 13,17, 19,23, and we could write many more 
terms out explicitly, but we do not have a formula for them. (This s is indeed an 
infinite sequence since there are infinitely many primes, as proved on page 33.) 
Note that {n}nez is NOT a sequence because the list has no first term. 
We can scramble the set Z of all integers into a sequence, for example as follows: 
{0, 1, —1, 2, -2,3, —3, 4, —4,...}, which algebraically equals 

oes ees if n is even; 

. —(n—1)/2, otherwise. 


(“Otherwise” applies to odd (positive) integers.) 

One can scramble the set Q* of all positive rational numbers into a sequence via 
a diagonal construction as follows. First of all, each positive rational number can 
be written in the form a/b for some positive integers a,b. Rather than plotting 
the fraction a/b, we plot the point (a, b) in the plane. Refer to Plot 8.1.2: the bold 
points are elements of Nt x NT, and each such (a, b) is identified with the fraction 
a/b. Every positive rational number appears in this way somewhere as a bold point, 
and all appear multiple times because ¢ = $£. Now we want to systematically 
enumerate these bold points/rational numbers. If we first enumerate all of them 


in the first row, and then proceed to the second row, well, actually, we never get 
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to the second row as we never finish the first row. So we need a cleverer way of 
counting, and that is done as follows. We start counting at (1,1), which stands 
for 1/1 = 1. We then proceed through all the other integer points in the positive 
quadrant of the plane via diagonals as in Plot 8.1.2. The given instructions would 
enumerate positive rational numbers as 1/1,2/1,1/2,1/3,2/2. Ah, but 2/2 has 
already been counted as 1/1, so we do not count 2/2. Thus, the proper counting 
of positive rational numbers in this scheme starts with: 


1/1,2/1, 1/2, 1/3, 2/2, 3/1, 4/1, 3/2, 2/3, 1/4, 1/5, 
2/4, 3/3, 4/2, 5/1,6/1,5/2, 4/3, 3/4, 2/5, 1/6, 


et cetera, where the crossed out numbers are not part of the sequence because they 
had been counted earlier. Thus in this count the fifth term is 3. 


1 2 3 4 5 6 


Plot 8.1.2 Counting of the positive rational numbers. 


It is important to note that every positive rational number appears on this list, 
and because we are skipping any repetitions, it follows that every positive rational 
number appears on this list exactly once. Thus this gives an enumeration of 


positive rational numbers. * 


A different enumeration of Q* is given with an algebraic formulation in Exer- 
cise 2.4.28. 


(13) If {s,,} is a sequence of positive rational numbers in which every positive rational 


number appears exactly (or at least) once, we can construct from it a sequence in 


* Here is a fun exercise: look at the ordered list of positive rational numbers above, including the crossed- 


(n+m—2)2 , 38n+m—4 
2 2 


out fractions. Verify for a few of them that n/m is in position { 


2 
a fact that f(x,y) = seer) 3etyo4 + 1 gives a bijection of (N+)? with Nt. This was first proved by Rudolf 
Fueter and George Polya, but the proof is surprisingly hard, using transcendence of e” for algebraic numbers r, 
so do not attempt to prove this without more number theory background. 


+1 on the list. Namely, it is 
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(14) 


(15) 
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which every rational number appears exactly (or at least) once as follows: 
0, tea: 
Gi * Sips if n is even; 
S(n—1)/2, ifn > 3 is odd. 
This sequence starts with 0, 51, —81, 52, —S2, $3, —S3,.... Since every positive ratio- 
nal number is one of the s,, (exactly once/at least once), so every rational number 
is on this new list (exactly once/at least once). 


Incidentally, it is impossible to scramble R or C into a sequence. This can be proved 


with a so-called Cantor’s diagonal argument, which we are not presenting here, 
but an interested reader can consult other sources. 


Sequences are functions, and if all terms of the sequence are real numbers, we can 
plot sequences in the usual manner for plotting functions. The following is part of 
a plot of the sequence {1/n}. 

Sn 
1 : 


e 
e 
e * rs 


12 3 4 5 6 7 8 9 10 11 12 138 14 15 
Plot 8.1.2 s, =1/n 


Another way to plot a sequence is to simply plot and label each s,, in the complex 
plane or on the real line. We plot three examples below. 
$1,$3,$5,--- $2,84,56,..- $3 S82 $1 

—1 1 0 1 


Plot 8.1.2 Image sets of {(—1)"}, {1/n}, {(-1)"/n + 0.5i/n}. 


There is an obvious arithmetic on sequences (just like there is on functions): 


{Sn} + {tn} = {8n £ tn}, 
{Sn} + {tn} = {8n-tn}, 
c{8n} = {cn}, 
{ant itay =—ASnfin) Gt, #0 forall 1). 


One has to make sure to add/multiply/divide equally numbered terms of the two 
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sequences, such as in the following: 
{n}n>3 + {n}n>2 = {n+ ljnd2 + {M}nz2 = {2n + Inde. 


Here are a few further examples of arithmetic on sequences, with + and - binary 
operations on the set of sequences: 


{2"} + {-2"} = {0}, 

{2™} + {(—2)”} = {0, 8, 0, 32, 0, 128, 0, 512,0,...}, 
2{2"} = {2"*"}, 

{2"} -{2°"} = {1}, 

{2n — 1} + {1} = {2n}, 

{(-1)"n} + {2/n} = {(-1)"n + 2/n}, 
{i"}/{(—t)"} = {(-1)"}. 


Exercises for Section 8.1 

8.1.1. Express algebraically the ordered sequence of all positive integer multiples of 3. 
8.1.2. Think of a sequence whose terms are all between 2 and 3. 

8.1.3. Plot the sequences {n?} and {1/n?}. Compare the plots. 

8.1.4. Plot the sequence {(—1)”}. 


8.1.5. Plot the image set of the sequence s = {7”}: draw the real and imaginary axes and 
the unit circle centered at the origin; on this circle, plot i!,7?,73,i4,7°,i°, and label each 
correspondingly with “s,”, “sg”, “s3”, “s5”, “sg”. Label also s29, $100, $101, $345: 

8.1.6. Prove that the difference of the ordered sequence of all positive odd integers and 


the constant sequence {1} is the ordered sequence of all non-negative even integers. 


8.1.7. We have seen that + and - are binary operations on the set of infinite sequences. 
i) Prove that {0} is the identity for + and that {1} is the identity for -. 
ii) Prove that every infinite sequence has an inverse for +. 
iii) Show that not every infinite sequence has an inverse for this operation. What 
infinite sequences have an inverse for this operation? 
8.1.8. Sequences can also be finite. Two examples of finite sequences are: (i) last exam 


scores in a class arranged in alphabetic order by the student; and (ii) last exam scores in 
a class arranged in ascending order by score. Give two more examples of finite sequences. 
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8.2 Convergence of infinite sequences 


Definition 8.2.1. A sequence s = {s,,} converges to L € C if for every real number 
€ > 0 there exists a positive real number N such that for all integers n > N, |sp — L| < €. 

If s converges to L, we also say that L is the limit of {s,,}. We use the following 
notations for this: 


Sn 2 L, {8} > L, 
lms =, dims,’ =, lim{s, } = 6, lim(s;,)=h, 


Jim, by =, Jim {8nf = L, lim $n = L, lim (Sn) —_ op 


and, to save vertical space, just like for limits of functions, we also use a variation on the 
jase. hreesliitiy 635 8p Sub. lim 4684 Sek J, limy cela) SU. 


We say that a sequence is convergent if it has a limit. 


For example, the constant sequence s = {c} converges to L = c because for all n, 


|s, — L| = |c — c| = 0 is strictly smaller than any positive real number e. 
The sequence s = {300, —5, 7, 4, 0.5, 10°, 2, 2, 2,2,2,...} converges to L = 2 because 
for all n > 7, |s, — L| = |2 — 2| = 0 is strictly smaller than any positive real number e. 


In conceptual terms, a sequence {s,,} converges if the tail end of the sequence gets 
closer and closer to L; you can make all s, with n > N get arbitrarily close to L by simply 
increasing N a sufficient amount. 

We work out examples of epsilon-N proofs; they are similar to the epsilon-delta 
proofs, and we go through them slowly at first. Depending on the point of view of your 
class, the reader may wish to skip the rest of this section for an alternative treatment 
in Section 8.5 in terms of limits of functions. More epsilon-N proofs are in Section 8.4. 
Be aware that this section is more concrete; the next section assumes greater ease with 
abstraction. 


Example 8.2.2. Consider the sequence s = {1/n}. Example 8.1.2 gives a hunch that 
lim s, = 0, and now we prove it. [(RECALL THAT ANY TEXT BETWEEN SQUARE BRACKETS 
IN THIS FONT AND IN RED COLOR IS WHAT SHOULD APPROXIMATELY BE GOING THROUGH 
YOUR THOUGHTS, BUT IT IS NOT SOMETHING TO WRITE DOWN IN A FINAL SOLUTION.) 
BY THE DEFINITION OF CONVERGENCE, WE HAVE TO SHOW THAT FOR ALL € > 0 SOME 
PROPERTY HOLDS. ALL PROOFS OF THIS FORM START WITH:| Let € be an arbitrary 
positive number. [NOW WE HAVE TO SHOW THAT THERE EXISTS AN N FOR WHICH 
SOME OTHER PROPERTY HOLDS. THUS WE HAVE TO CONSTRUCT AN N. USUALLY 
THIS IS DONE IN RETROSPECT, ONE CANNOT SIMPLY GUESS AN NV, BUT IN THE FINAL 
WRITE-UP, READERS SEE SIMPLY THAT EDUCATED GUESS — MORE ABOUT HOW TO GUESS 
EDUCATEDLY LATER:] Set N = +. Then N is a positive real number. [Now WE HAVE TO 
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SHOW THAT FOR ALL INTEGERS n > N, |s, — 0| < €. ALL PROOFS OF STATEMENTS OF 
THE FORM “FOR ALL INTEGERS n > N” START WITH:] Let n be an (arbitrary) integer 
with n > N. [FINALLY, WE HAVE TO PROVE THE INEQUALITY |s,, —0| < €. WE DO THAT 
BY ALGEBRAICALLY MANIPULATING THE LEFT SIDE UNTIL WE GET THE DESIRED FINAL 
<e:| 
[$n — O| = |1/n — | 
= |1/n| 
=1/n (because n is positive) 


<1/N (because n > N > 0) 


1 
= (because N = 1/e) [THAT WAS A CLEVER GUESS!] 


(1/e) 


So we conclude that |s,, — 0| < €, which proves that lim s,, = 0. 

Just as in the epsilon-delta proofs where one has to find a 6, similarly how does one 
divine an N? In the following two examples we indicate this step-by-step, not as a book 
or your final homework solution would have it recorded. 


Example 8.2.3. Let s, = {4((—1)” +i(-1)"*")}. If we write out the first few terms, we 
find that {s,} = {—1+i,1/2—1/2, -1/3+i/3,...}, and we may speculate that lim s, = 0. 
Here is plot of the image set of this sequence in the complex plane: 


We prove that lims, = 0. Let e > 0. Set N = . [WE WILL EVENTUALLY 
FILL IN WHAT THE POSITIVE REAL NUMBER N SHOULD BE, BUT AT THIS POINT OF THE 
PROOF SIMPLY LEAVE SOME BLANK SPACE. ASSUMING THAT WN IS IN PLACE, WE NEXT 
NEED TO PROVE THAT FOR EVERY INTEGER n > N, THE RELEVANT PROPERTY AS IN 
THE €— N DEFINITION OF LIMITS HOLDS. 'THE PROOF OF “FOR EVERY INTEGER n > N” 
ALWAYS STARTS WITH:] Let n be an integer strictly bigger than N. [WE WANT TO MAKE 
SURE EVENTUALLY THAT n IS POSITIVE, WHICH IS GUARANTEED IF N IS POSITIVE, BUT 
WITH BLANK N, WE WILL SIMPLY ASSUME IN THE ALGEBRA BELOW THAT N IS POSITIVE. |] 
Then 


Jen — 0] =|=((-1)" + 4(-1)"*¥) - 0 
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1 
Saleh |(—1)” + 4(—1)"*"| (because |ab| = |a||b]) 
1 
= |(—1)" + i(—1)"*"| (because n is positive) 
1 
= rae 
1 
n 
1 
a V2 (because n > N) 


[ASIDE: WE WANT/NEED V2/N < €, AND V2/N =€ IS A POSSIBILITY, SO SET N = V2/e. 
Now GO AHEAD, WRITE THAT MISSING INFORMATION ON WN IN LINE 1 OF THIS PROOF!] 
V2 
V2/e 


= €; 


(because N = V/2/e) 


which proves that for all n > 1/e, |s, —0| < ¢. Since € is arbitrary, this proves that 


lim s,, = 0. 


Thus a polished version of the example just worked out looks like this: 
We prove that lim {+((—1)” + i(-1)"*1) — 0} =0. Let e > 0. Set N = V2/e. Then 
N is a positive real number. Let n be an integer strictly bigger than N. Then 


ree Gs $i(-1)"*4) —0 


slr 3s 


| - |(—1)” + o(=1)"""| (because |ab| = |a]|b]) 


-|(—1)" + i(—1)"*"| (because n is positive) 


Ge (ea. 


SlrRsleslresle 


V2 
< —- V2 (because n > N) 


a) 
= Te, 


=€, 


(because N = V/2/e) 


which proves that for all n > V2/e, |s, —0| < €. Since € is arbitrary, this proves that 


lim s, = 0. 
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Example 8.2.4. Claim: lim anton = 3. Proof: Let € > 0. Set N = . Let n be 


an integer strictly bigger than N. Then 
2n + 3n? Q2n+3n? 3(34+4n+n?) 
34+4n4+ n? 3+4n4+ n? 3+4n4+ n? 
—9—10n 
3+4n+n? 
9+ 10 
eM ET — i (because n > 0) 
[ASSUMING THAT N > 0.] 


n+ 10n 
~ 3+4n+n? 
[ASSUMING THAT N > 8.] 


(because n > 9) 


lin 
3+4n+n? 


11 
< — (because 3+ 4n +n? > n?, 
n 


so 1/(3 + 4n + n?) < 1/n?) 
11 


n 

11 

NW (because n > N) 
1 


IA 


a: 
ile (because N > 11/e so 1/N < 1/(11/e)) 
[ASSUMING THIS. | 
=> €, 


which was desired. Now (on scratch paper) we gather all the information we used about N: 
N>0, N > 8, N > 11/e, and that is it. Thus on the first line we fill in the blank part: 
Set N = max{8,11/e}, which says that N is either 8 or 11/e, whichever is greater, so that 
N>8and N > 11/e. 


The polished version of this proof would go as follows: 


Example 8.2.5. Claim: lim 2n+3n* — 3. Proof: Let e > 0. Set N= max{8, 11/e}. Then 


3+4n4+n2 
N is a positive real number. Let a be an integer strictly bigger than N. Then 
2n + 3n? _| 2n+3n? 3(3 + 4n + n?) 
3+4n+n? =|. 3+4n+4+n? 
=| == 100 
7 | 3+4n+n? 
9+ 10n 


= carer park) (because n> 0) 
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n+10n 
~ 3+4n+ n? 
lin 


34+4n+ n? 


11 
< aa (because 3+ 4n +n? > n?, 
n 


(because n > 9) 


so 1/(3 + 4n + n?) < 1/n?) 
11 


n 
1 
NW (because n > N) 
11 
ii/e (because N > 11/e so 1/N < 1/(11/e)) 
€ 
E, 


IA 


which proves that for alln > N, 
of this sequence is 3. 


Sn —3] < . Since € is arbitrary, this proves that the limit 


Below is a polished proof of a very similar problem. 


Example 8.2.6. Claim: lim =2n+3n" _ 3. Proof: Let > 0. Set N = max{2,20/e}. Then 


9—An+n? 
N is a positive real number. Let n be an integer strictly bigger than N. Then 
—2n + 3n? _ | -2n+3n? 3(9-—4n +n’) 
9—4n+n? ~ 19—4n +n? 9—4n+n? 
_ | -274+10n 
9—4n+n? 
—27+ 10 
eee newe (because n > N > 2,son>3, 
9-—4n+n? 
so 10n—27 > 0 and n?—4n+9 = (n—3)7+2n>0) 
10 
< Sin ee jae 72 (because 10n — 27 < 10n) 
2 10n 
9 — 4n +4 (1/V2)n? + (1 — 1/V2)n? 
10n 
< i 
a —1/Van2 


(because 0 < (n — 4)? + 2 = 2(9 — 4n + (1/V/2)n?)) 


10 
ies (because 1 — 1/./2 > 0.5, so =e < gs) 


<7 (because n > N) 
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20 
=< 30/e (because N > 20/e so 1/N < 1/(20/e)) 
€ 
=€ 


which proves that for alln > N, |s, —3| < €. Since € is arbitrary, this proves that the limit 


of this sequence is 3. 


Example 8.2.7. Let r € C such that |r| < 1. Then lim, ..7r” = 0. 


Proof. If r = 0, the sequence {r”} is the constant zero sequence so certainly the limit is 0. 


So we may assume without loss of generality that r 4 0. Let e > 0. Set N = 
In(min{e,0.5}) 
In |r| 
number. Let n be an integer with n > N. Then 


. Since In of numbers between 0 and 1 is negative, we have that N is a positive 


nO] =f" 
= |r| [nl 
<1" |r| (by Theorem 7.6.5) 
= min{e, 0.5} 
<e€. 


Example 8.2.8. lim { (1 + 1)"\ =e. 


Proof. All the hard work for this has been done already in Exercise 7.6.10. Let « > 0. By 
Exercise 7.6.10, limyz 6 {(1 + 1)\"} =e. Thus there exists N > 0 such that for all x > N, 
{(1+4)"}-e]<e. 


|{ (2 + 4)"\ — e| <e. In particular, for any integer n > N, 


Example 8.2.9. limn!/” = 1. 


Proof. For all n > 2, by Exercise 1.7.7, 


n= (nV) = (n/™ 141)" = 3 & (ni/> — 1). 


k=0 
Each of the summands is non-negative, and if we only use the summands with k = 0 and 
k = 2, we then get that 
n>1t+ s(n Se =1)7. 

By subtracting 1 we get that n—1 > $n(n—1)(n!/"—1)?, so that for n > 2, 2 S (nl s1)2, 
and hence that “2 Siva, Certainly ni/” —1 > 0 for all n > 1. It follows that for all 
n > 2, and even for alln >1,0<n!/"-1< vA. Now let ¢ > 0. Set N = max{2, 2/e7}. 
Then JN is a positive real number. Let n > N be an integer. Then 0 < ni/n_p< vA < 


2 = €, which proves that |n!/” 


—1| <e, and hence proves this limit. 
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Example 8.2.10. Let M be a positive real number. Then lim M‘/” = 1. 


Proof. First suppose that M > 1. Then certainly for all integers n > M, we have that 
1< M'¥/" < n'/"_ Let « > 0. By the previous example, there exists N > 0 such 
that for all integers n > N, 0 < n'/" —1 < . Then for all integers n > max{M, N}, 
0< MV" —1<n/"_1 <e, which implies that |M1/” — 1| < €. This proves the example 
in case M > 1. 

Now suppose that M < 1. By assumption, 1/M > 1, so by the previous case 
(and by using that (1/M)!/"” = 1/M‘/"), there exists N > 0 such that for all integers 
n> NOS 1/Mi/n —1<e. By adding 1 to all three parts in this inequality we get that 
1<1/M 1/n < €+41, so that by compatibility of < with multiplication by positive numbers, 


=a < M'/" <1. Hence by compatibility of < with addition, 
1 € 


= < 
eset ese 
since e+ 1> 1. Thus |1 — M‘/"| < €, which proves that lim M1/” = 1. 


C211 re 


€, 


Exercises for Section 8.2 


8.2.1. Let s, =1/n?. Fill in the blanks of the following proof that lim{s,} = 0. 


Let € > 0. Set N = . Then ifn>N, 
1 
a= 2-0 

oh 

= (because ) 
1 

< qa (because ) 

= (because ) 


= €. 


8.2.2. Prove that for any real number L, lim{+ + L} = L. 

8.2.3. Prove that for any positive real number k, lim{1/n’} = 0. 
8.2.4. Let f : NC be a bounded function. Prove that lim fn) ="). 
8.2.5. Prove that lim ante =(); 


8.2.6. Prove that lim Sut — 3. 


8.2.7. Prove that lim {/n +1-— /n} =0. 
8.2.8. Prove that for every positive integer k, lim {Vn +k- vn} al) 
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8.2.9. Suppose that the terms of a sequence are given by s, = )+)_, CESWE 


i) Using induction on n, prove that )77_, KEED =] ae 


ii) Use this to find and prove the limit of {s,,}. 


8.2.10. Prove the following limits: 
i) lim naa = 1. (Hint: epsilon-N proof, continuity of In.) 


In(ln(n)) _ 
In(Iin(n+1)) ~~ 1. 


8.2.11. Let r € R. By Theorem 3.8.8, for every positive integer n there exists a rational 
number s, € (r — +,r ++). Prove that {s,,} converges to r. 


ii) lim 


8.2.12. Let r be a real number with a known decimal expansion. Let s, be the rational 
number whose digits n+ 1,n + 2,n+3, et cetera, beyond the decimal point are all 0, and 
all other digits agree with the digits of r. (For example, if r = 7, then s; = 3.1, so = 3.14, 
87 = 3.1415926, et cetera.) Prove that lim{s,,} =r. (Repeat with binary expansions if you 


know what a binary expansion is.) 
8.2.13. Prove that lim, { ‘Vn! } =1. 
8.2.14. What is wrong with the following “proof” that lim,_.. oat = 5. 


“Proof.” Let € > 0. Set N = 2+. Let n> N. Then 
n 1 2n — (2n +1) 

2n+1 2 2(2n + 1) 

—1 
2(2n +1) 

1 
2(2n + 1) 

1 
< SN 1) (ON +1) (because all terms are positive) 


= €. 


8.3 Divergence of infinite sequences and infinite limits 


The sequence {(—1)”} alternates in value between —1 and 1, and does not seem to 


converge to a single number. The following definition addresses this situation. 
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Definition 8.3.1. A sequence diverges if it does not converge. In other words, {s,} 
diverges if for all complex numbers L, lim{s,} 4 L. 


By the usual negation of statements (see chart on page 32), lim{s,,} #4 ZL means: 


not (For all real numbers € > 0 there exists a positive real number 
N such that for all integers n > N, |sn — L| < €.) 


= There exists a real number € > O such that 
not ( there exists a positive real number N such that for all 
integers n > N, |s, — L| <«.) 


= There exists a real number ¢€ > 0 such that for all positive real 
numbers N, not ( for all integers n > N, |s, — L| < €.) 


There exists a real number € > 0 such that for all positive real 
numbers N there exists an integer n > N such that not ( 
|Sn — L| < €.) 


There exists a real number € > 0 such that for all positive 
real numbers N there exists an integer n > N such that 
ls, —L| >. 


Example 8.3.2. {(—1)"} is divergent. Namely, for all complex numbers JL, lim s,, 4 L. 


Proof. Set € = 1 (half the distance between the two values of the sequence). Let N be 
an arbitrary positive number. If Re(Z) > 0, let n be an odd integer greater than N, 
and if Re(L) < 0, let n be an even integer greater than N. In either case, |s, — L| > 
|Re(s,) — Re(L)| > 1 =e. 


The sequence in the previous example has no limit, whereas the sequence in the next 


example has no finite limit: 
Example 8.3.3. For all complex numbers L, lim{n} 4 L. 


Proof. Set € = 53 (any positive number works). Let N be a positive real number. Let n 
be any integer that is strictly bigger than N and strictly bigger than |L| + 53 (say strictly 
bigger than N + |L| +53). Such an integer exists. Then by the reverse triangle inequality, 
In — L| > |n| —|L| 2 58 =e. 


The last two examples are different: the first one has no limit at all since the terms 
oscillate wildly, but for the second example we have a sense that its limit is infinity. We 
formalize this: 
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Definition 8.3.4. A real-valued sequence {s,,} diverges to oo if for every positive real 
number M there exists a positive number N such that for all integersn > N, 8, > M. We 
write this as lims,, = oo. 

A real-valued sequence {s,,} diverges to —oo if for every negative real number M 
there exists a positive real number N such that for all integers n > N, 8, <M. We write 


this as lim s,, = —oo. 
Example 8.3.5. limn = oo. 


Proof. Let M > 0. Set N = M. (As in epsilon-delta or epsilon-N proofs, we must figure 
out what to set N to. In this case, N = M works). Let n € N* with n > N. Since N = M, 
we conclude that n > M, and the proof is complete. 


Example 8.3.6. lim Wn! = oo. 
Proof. Let M > 0. Let Mo be an integer that is strictly greater than M. By Example 8.2.10 
there exists N, > 0 such that for all integers n > Nj, 


Mo! 
(Mo + 1)o 


Mor tee 
Mo 1 


n 


Since Mo! = Mo(Mo — 1)(Mo — 2):--3-2-1< (My +1)™, this means that 


<1 — 2{__Me Mo+1—M _ M 
= (Mp + 1)™o Mo +1 “= Mj +1’ 


so that %/ Omiya > ieee Set N = max{N,, Mo}. Then for all integers n > N, 


Mo! 
—_ —1)---(Mpn+1)- Mal > (My +1)"-™2 . MO! = (Ma + 1)" ——_—-___ 
nl = n(n —1)---(Mo + 1)- Mo! = (Mo + 1) o! = (Mo + 1) (Mp + 1) Mo’ 
so that 
i Mo)! M 
Vn! > (M 1)?)—————— M 1 =M 
mh 2 (Mo 1) Ai ie aye > MOP ag 


Theorem 8.3.7. (Comparison theorem (for sequences with infinite limits)) Let 
{sn}, {tn} be real-valued sequences such that for all sufficiently large n (say for n > N for 
some fixed N), 8, < tn. 

(1) If lim sp, = co, then limtn = oo. 

(2) If limt, = —oo, then lim s, = —oo. 


Proof. (1) By assumption lim s,, = co for every positive M there exists a positive N’ such 
that for all integers n > N’, s, > M. Hence by assumption t,, > s, for all n > N we get 
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that for every positive M and for all integers n > max{N, N’}, t, > M. Thus by definition 


lim t, = 00. 


Part (2) has an analogous proof. 


Example 8.3.8. lim a = oOo. 
Proof. Note that for all n € N*, al =nt+ 4 > n, and since we already know that 
n ntl 


lim n = oo, it follows by the comparison theorem ok that lim = Oi 


n?—1 
n 


Example 8.3.9. 


= ©. 


Proof. Note that for all integers n > 2, nm-l __yn—1l>2 We already know that 


n TM: 2D. 
limn = oo, and it is straightforward to prove that lim 5 = oo. Hence by the comparison 
n2-1 
n 


= 00. 
Or, we can give an M — N proof. Let M > 0. Set N = max{2,2M}. Let n be an 
integer strictly bigger than N. Then 


eae 
n ae Be ap 
n n 2 2 


Theorem 8.3.10. Let {s,} be a sequence of positive numbers. Then lim s, = oo if and 
only if lim + =. 


Proof. Suppose that lims, = oo. Let « > 0. By the definition of infinite limits, there 
exists a positive number N such that for all integers n > N, s, > 1/e. Then for the same 
n 0< = < €, so that |4| <e. This proves that lim = =); 

Now suppose that lim = = 0. Let M bea positive number. By assumption lim = = 
0 there exists a positive number N such that for all integers n > N, |x —0|<1/M. Since 
each s,, is positive, it follows that for the same n, = <1/M, so that s, > M. This proves 


that lim s, = oo. 


Exercises for Section 8.3 


8.3.1. Prove that the following sequences diverge: 


i) {vn}. 

ay oy 

iii) {(n + 1)? — n3}. 

iv) {(-1)” + 3/n}. 
8.3.2. Suppose that {s,,} diverges and {t,,} converges. Prove that {s, +t,} diverges. 
8.3.3. Suppose that {s,,} diverges and {t,,} converges to a non-zero number. Prove that 
{Sn +t} diverges. 
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8.3.4. Give an example of two divergent sequences {s,,}, {tn} such that {s,,+t, } converges. 


8.3.5. Let {s,} be a sequence of negative numbers. Prove that lim s,, = —oo if and only 


if lim + = 0. 


8.3.6. Given the following sequences, find and prove the limits, finite or infinite, if they 


exist. Otherwise, prove divergence: 


i) {345}. 
ii) {25-2}. 
iii) {PP}. 
iv) {545}. 
v) {25788}. 
vi) {5°}. 
vii) {77}. 

) 


Se n! 
vill {ati 
8.3.7. Prove or give a counterexample: 


i) If {s,} and {t,,} both diverge, then {s,, + t,,} diverges. 


) 
ii) If {s,,} converges and {t,,} diverges, then {s, +t, } diverges. 
iii) If {s,,} and {t,,} both diverge, then {s,, - t,,} diverges. 

) 


If {s,,} converges and {t,,} diverges, then {s,, - t,} diverges. 
8.3.8. Find examples of the following: 
i) A sequence {s,,} of non-zero terms such that lim{—2-} = 0. 


Sn41 
ii) A sequence {s,,} of non-zero terms such that lim{2+*+} = 1. 


) 
) 
iii) A sequence {s,,} of non-zero terms such that lim{=*} Teo 
iv) A sequence {s,,} such that lim{s,41 — s,} = 0. 
) 


v) A sequence {s,,} such that lim{s,41 — 5, } = 00. 


8.3.9. Suppose that the sequence {s,,}, diverges to oo (or to —oo). 


Prove that {s,}n 


diverges. (The point of this exercise is to parse the definitions correctly. ) 


8.4 Convergence theorems via epsilon-N proofs 


All of the theorems in this section are also proved in Section 8.5 with a different 


method; here we use the epsilon-N formulation for proofs without explicitly resorting to 


functions whose domains have a limit point. I recommend reading this section and omitting 


Section 8.5. 
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Theorem 8.4.1. If a sequence converges, then its limit is unique. 


Proof. Let {s,} be a convergent sequence. Suppose that {s,,} converges to both L and L’. 
Then for any € > 0, there exists an N such that |s, — L| < €/2 for alln > N. Likewise, for 
any € > 0, there exists an N’ such that |s,,— L’| < €/2 for alln > N’. Then by the triangle 
inequality, |L — L’| = |L — 8, + 8, — L'| < |L — sn|+|sn — L'| < €/2 + €/2 =e. Since € is 
arbitrary, by Theorem 2.10.4 it must be the case that |Z — L’| = 0, i.e., that LD = L’. 


Theorem 8.4.2. Suppose that lim{s,,} = L and that L £0. Then there exists a positive 
number N such that for all integers n > N, |s,| > |L|/2. In particular, there exists a 
positive number N such that for all integers n > N, sp, # 0. 


Proof. Note that p = |L£|/2 is a positive real number. Since lim{s,,} = L, it follows that 
there exists a real number N such that for all integers n > N, |s, — L| < |L|/2. Then by 
the reverse triangle inequality (proved in Theorem 2.10.3), 


[$n] = [Sn —L + L| = |(8n — L) + £| 2 |L| — |8n — L] > [L| — |L|/2 = |L|/2. 


Theorem 8.4.3. Suppose that lims, = L and limt, = K. Then 
(1) (Constant rule) For any complex number c, lim{c} = c. 
(2) (Linear rule) lim{1/n} = 0. 
(3) (Sum/difference rule) lim{s, +t,}= D+ K. 
(4) (Scalar rule) For any complex number c, lim{cs,} = cL. 
(5) (Product rule) lim{s,t,} = LK. 
(6) (Quotient rule) Ift, #0 for alln and K 40, then lim{s,,/t,} = L/K. 
(7) (Power rule) For all positive integers m, lim{s'?} = L™. 


Proof. Part (1) was proved immediately after Definition 8.2.1. Part (2) was Example 8.2.2. 

Part (3): Let « > 0. Since lim s, = L, there exists a positive real number N; such 
that for all integers n > Nj, |s, — L| < €/2. Since limt, = K, there exists a positive real 
number N2 such that for all integers n > No, |t, —K| < €/2. Let N = max{N,, No}. Then 
for all integers n > N, 


(Sn Ein) — (LK) = |(8n — L) + (in — )| 
< |s, — L| + |t, — K| (by the triangle inequality) 
<€/2+6€/2 (sincen > N > NM, No) 
= €. 
This proves (3). 


Part (4): Let € > 0. Note that €/(|c| + 1) is a positive number. Since lims, = L, 
there exists N such that for all integers n > N, |s, — L| < €/(|c| +1). Then for the same 
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n, |C&, — cL| = |c|- |sn — L| < |cle/(|c] +1) = ater <e. Since € is any positive number, 
we conclude that cs, converges to cL. 

Part (5): Let « > 0. Since lim{s,,} = L, there exists N; such that for all integers 
n> Ni, |s,—L| <1. Thus for all such n, |s,,| = |s, -L+L| < |s,—L|+|L| < 1+|L|. There 
also exists Nz such that for all integers n > No, |s, — L| < €/(2|K|+ 1). By assumption 
lim{t,,} = K there exists N3 such that for all integers n > Ns, |tn — K| < €/(2|Z|+2). Let 


N = max{ Nj, No, N3}. Then for all integers n > N, 


lSntpn — LK | = |8ntn — 5nK +8,K —LK| (by adding a clever 0) 
= |(Sntn — Sn) + (snk — LK)| 


< |Sntn — Sn K|+ |s,K —LK| (by the triangle inequality) 
= |8n(tn — K)| + |(Sn — L).K| 
= |8n| + [tn — K] +n — L] - |K| 
€ € 

D\|+1)- Ke 
SaPlry E42 AK) +1 es) 

Ect 

2 2 


which proves (5). 

Part (6): Let « > 0. Since lim{s,,} = L, there exists N; such that for all integers 
n > Ny, |S, — L| < 1. Thus for all such n, |s,| = |s, -£4+D| < |s, —L/+|£L| < 14+ |L£. 
There also exists N2 such that for all integers n > No, |S, — L| < lle By assumption 
lim{t,} = L there exists N3 such that for all integers n > Ns, |tn — K| < wy Since 
Kk # 0, by Theorem 8.4.2 there exists a positive number Ny, such that for all integers 


n> Na, |tn| > |K|/2. Let N = max{Nj, No, N3, Na}. Then for all integers n > N, 


Sn L om K-s,—-L-ty 
tn K\| Kat, 
K- Ho bn Sy tnSn —L-ty ; 
= ° oe (by adding a clever 0) 
K-ty 
K + 8p, — tn8n a ae Bs . 
< a rs ° e on (by the triangle inequality) 
1 1 1 
= |K —t,||sp| +|L=s,| 
JA] [tn | K 
|K|?e 19s. WKF 
oa ee 
4(1 + |L]) |K||K| 2 |K| 
Stef 
2 8 


=€, 


which proves (6). 
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Part (7): The proof is by induction on m, the base case being the assumption. If 
limnséo{1/n™—'} = 0, then by the product rule; lim,.{1/n™} = limaseo{l/n™ - 
1/n} = limp sof{1/n™ 4} + limy+00{1/n} = 0. This proves (7). 


5n-—2 


Example 8.4.4. Suppose s, = To prove that lims, = 5/3, we note that s, = 


3n+4° 
Ss ae = ee By the linear rule, lim(1/n) = 0, so by the scalar rule, lim(2/n) = 
lim(4/n) = 0. Thus by the constant, sum, and difference rules, lim{5 — 2/n} = 5 and 


lim{3 + 4/n} = 3, so that by the quotient rule, lim s,, = 5/3. 


2 2 
Example 8.4.5. Let s,, = 24+2. Note that s, = 242. cape STAAL . By the linear 
n2—3 n2—-3 1/n 1-3/n 


rule, lim1/n = 0, so that by the scalar rule, lim3/n = 0, and by the product and scalar 
rules, lim 2/n? = lim3/n? = 0. Thus by the sum and difference rules, lim{3/n+2/n?} = 0 
and lim{1 — 3/n?} = 1. Finally, by the quotient rule, lims, = 0/1 =0. 


Theorem 8.4.6. (Power, polynomial, rational rules for sequences) For any positive 
integer m, limn+oo{1/n™} = 0. If f is a polynomial function, then lim{ f(1/n)} = f(0). If 
f is a rational function that is defined at 0, then lim{ f(1/n)} = f(0). 


Proof. By the linear rule (in Theorem 8.4.3), limp+.{1/n} = 0, and by the power rule, 
for all positive integers m, limy,..{1/n™} = 0. 

Now write f(x) = ao +aiv +--+ + a,x" for some non-negative integer k and some 
complex numbers dao, a1,...,@%. By the constant, power and repeated sum rules, 


lim filin)= lim {ag + ay(1/n) + a2(1/n)? +--+ + ag(1/n)*} 
= 49 + lim ay(1/n) + lim ag(1/n)? +--+ + lim ag(1/n)* 
=aot+a1:0+a2-0? +--+ + az: 0" 
= a0 = f (0). 


This proves the polynomial rule. 

Finally, let f be a rational function. Write f(x) = g(x)/h(x), where g,h are polyno- 
mial functions with h(0) 4 0. By the just-proved polynomial rule, lim,_... g(1/n) = g(0), 
limn+oo h(1/n) = h(0) 4 0, so that by the quotient rule, lim,-. f(1/n) = g(0)/h(0) = 
7(0). 


Theorem 8.4.7. (The composite rule for sequences) Suppose that lim s,, = L. Let 
g be a function whose domain contains [ and all terms s,,.. Suppose that g is continuous 
at L. Then lim g(sn) = g(L). 


Proof. Let € > 0. Since g is continuous at L, there exists a positive number 6 > 0 such 
that for all x in the domain of g, if |a — L| < 6 then |g(x) — g(L)| < €. Since lims, = LD, 
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there exists a positive number N such that for all integers n > N, |s, — L| < 6. Hence for 


the same n, |g(Sn) — g(L)| < «. 


In particular, since the absolute value function, the real part, and the imaginary part 
functions are continuous everywhere, we immediately conclude the following: 


Theorem 8.4.8. Suppose that lims, = L. Then 
(1) lim |s,,| = |L]. 
(2) limRes, = ReL. 
(3) limIms,, = Im L. 


Furthermore, since the real and imaginary parts determine a complex number, we 


moreover get: 


Theorem 8.4.9. A sequence {s,,} of complex numbers converges if and only if the se- 
quences {Re s,,} and {Im s,,} of real numbers converge. 


Proof. By Theorem 8.4.8 it suffices to prove that if lim{Res,,} = a and lim{Ims,,} = }, 
then lim{s,,} = a+ bi. Let « > 0. By assumptions there exist positive real numbers Nj, N2 
such that for all integers n > N, | Res, —a| < €/2 and such that for all integers n > No 
|Ims, — b| < €/2. Set N = max{Nj, No}. Then for all integers n > N, 
|sn — (a+ bi)| = |Res, +iIms, — a— bil 
<|Res,, —al|i||Ims, — | 
<¢€/2+6€/2 


= €. 


Theorem 8.4.10. (Comparison of sequences) Let s and t be convergent sequences of 


complex numbers. Suppose that |s,| < |t,| for all except finitely many n. Then |lims,| < 
| irae a 

If in addition for all except finitely many n, s,,t, are real numbers with sy, < tn, 
then lim s,, < limt,,. 


Proof. Let L = lims,, K = limt,. By Theorem 8.4.8, lim |s,,| = |Z], lim |t,| = |]. 

Suppose that |L| > |K|. Set « = (|L| — |K|)/2|. By the definition of convergence, 
there exist Ni, N2 > 0 such that ifn is an integer, n > N, implies that —e < |s,,|—|L| < e, 
and n > N2 implies that —e < |t,,|—|K| < «. Let N3 be a positive number such that for all 
integers n > N3, |Sn| < |tn|. If we let N = mazr{ Nj, No, N3}, then for integers n > N we 
have that |t,| < |A| +6 = (|L| + |K])/2 = |L| —e€ < |s,|. This contradicts the assumption 
lsn| < |tn|, so that necessarily |L| < |K}. 

The proof of the second part is similar and left to the exercises. 
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Theorem 8.4.11. (The squeeze theorem for sequences) Suppose that s,t,u are se- 
quences of real numbers and that for all n € Nt, sn < tn < Un. If lims and limu both 
exist and are equal, then limt exists as well and 


liims = limt = lim w. 


Proof. Set L = lims = limu. Let € > 0. Since lims = L, there exists a positive N, such 
that for all integers n > Nj, |sn — L| < €. Since limu = L, there exists a positive N2 such 
that for all integers n > No, |un — L| < «. Set N — max{N,, No}. Let n be an integer 
strictly greater than N. Then —e < s, -L <t,-—L<un—L<e, so that |t, — L| <e. 
Since € was arbitrary, this proves that limt = L. 


Example 8.4.12. lim(n+1)/"=1 


Proof. For n > 2,1 < Cr) = ee < n'/". Thus by the previous theorem and 
by Example 8.2.9, lim{ ( ntl ye "hn = 1. Hence by Theorem 8.4.3 and by Example 8.2.9, 


1 
lim{(n + 1) ae = } = lim { ( oe a niin} = lim { (a2) 7" lim{n/"} = 1. 


Exercises for Sections 8.4 and 8.5 


8.4.1. Compute the following limits, and justify: 


ju 20° 


8.4.2. Give an example of a sequence {s,,} and a number FL such that lim|s,,| = |L| but 
{sn} does not converge. Justify your example. 


8.4.3. Prove that lim ‘oo i i 0. 

{8.4.4. Let {s,} be a convergent sequence of positive real numbers. Prove that lim s,, > 0. 
8.4.5. Prove that for all integers m, limn_+., (eeny"™ =1. 

8.4.6. Prove that lim, .. {3 — 2} =f 


8.4.7. Prove that lim {/n+1— yn} =0. 
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8.4.8. By Example 8.2.8, lim { (1 + 1)"\ =e. Determine the following limits: 


){ 

ii) {+ 4)"} 
iii) {(1+ 1)? 
W4(lhae) 
vy) {ata} 


8.4.9. Let s; be a positive real number. For each n > 1, let sn41 = \/Sn. Prove that 
lim s, = 1. (Hint: Prove that lim s,, = lim s?.) 


8.5 Convergence theorems via functions 


The results in this section are the same as those in Section 8.4, but here they are 
proved with theorems about limits of functions that were proved in Chapter 4. So, a 
connection is made between limits of functions and limits of sequences. The reader may 
omit this section (or the previous one). This section is more abstract; one has to keep 
in mind connections with functions as well as theorems about limits of functions to get 
at theorems about limits of sequences. Exercises for this section appear at the end of 
Section 8.4. 


For any sequence s we can define a function f : {1/n:n € N*} > C with f(1/n) = 
sy. Conversely, for every function f : {1/n:n € Nt} — C we can define a sequence s with 
$= lyn): 

The domain of f has exactly one limit point, namely 0. With this we have the usual 
notion of lim,-50 f(a) with standard theorems from Section 4.4, 


Theorem 8.5.1. Let s, f be as above. Then lims, = L if and only if limy, 49 f(a) = L. 


Proof. (=) Suppose that lim s, = L. We have to prove that lim,_,o f(z) = L. Let € > 0. 
By assumption lim s,, = L, there exists a positive real number N such that for all integers 
n> N, |s, —L| < «. Let 6 =1/N. Then 6 is a positive real number. Let x be in the 
domain of f such that 0 < |x —0| < 6. Necessarily x = 1/n for some positive integer 
n. Thus 0 < |x — 0] < 6 simply says that 1/n < 6 = 1/N, so that N <n. But then by 
assumption |f(1/n) — L| = |s, — L| < €, which proves that lim; _,9 f(x) = L. 

(<=) Now suppose that lim,.9 f(z) = L. We have to prove that lims, = L. Let 
€ > 0. By assumption lim,_,9 f(z) = L there exists a positive real number 6 such that for 
all x in the domain of f, if 0 < |a —0| < 6 then |f(x) — L| < «. Set N = 1/6. Then N 
is a positive real number. Let n be an integer greater than N. Then 0 < 1/n <1/N =0, 
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so that by assumption |f(1/n) — L| < e. Hence |s,, — L| = |f(1/n) — L| < €, which proves 
that lims, = L. 


Example 8.5.2. (Compare the reasoning in this example with the epsilon-N proofs of 


Section 8.2.) Let sy, = aa We note that s, = ae : oe = at. The corresponding 


function f : {1/n : n € N*} is f(x) = 2-82, and by the scalar, sum, difference, and 


3+42? 
quotient rules for limits of functions, lim,_.9 f(x) = 30 = 5/3, so that by Theorem 8.5.1, 
lim s, = 5/3. 
2 2 
Example 8.5.3. Suppose s, = *3+2. Note that s, = 3 . oe = “- . The 
2 

corresponding function f : {1/n : n € N*} is f(x) = fee , and by the scalar, sum, 
difference, product, and quotient rules for limits of functions, lims, = lim; 49 f(#) = 
0+0 _ g 

1-0 : 


Theorem 8.5.4. The limit of a converging sequence is unique. 


Proof. Let {s,} be a convergent sequence. Suppose that {s,} converges to both L 
and L’. Let f : {1/n:n € N*} be the function corresponding to s. By Theorem 8.5.1, 
lim,—+o0 f(x) = Land lim,_49 f(z) = L’. By Theorem 4.4.1, L = L’. This proves uniqueness 


of limits for sequences. 


Theorem 8.5.5. Suppose that lim{s,,} = LZ and that L £0. Then there exists a positive 
number N such that for all integers n > N, |s,| > |L|/2. In particular, there exists a 
positive number N such that for all integers n > N, sp, 4 0. 


Proof. Let f be the function corresponding to s. Then lim,-,9 f(x) = L, by Theorem 4.4.2 
there exists 6 > 0 such that for all x in the domain of f, if x < 6 then |f(x)| > |L|/2. 
Set N = 1/6. Let n be an integer strictly greater than N. Then 1/n < 1/N = 6, so 
[sn] = [f(1/n)| > |£1/2. 


Theorem 8.5.6. Suppose that lims, = L and limt, = kK. Then 
(1) (Constant rule) For any complex number c, lim{c} = c. 
(2) (Linear rule) lim{1/n} = 0. 
(3) (Sum/difference rule) lim{s, +t,}= D+ K. 
(4) (Scalar rule) For any complex number c, lim{cs,,} = cL. 
(5) (Product rule) lim{syt,} = LK. 
(6) (Quotient rule) If t, 4 0 for alln and K 40, then lim{s,,/t,} = L/K. 


(7) (Power rule) For all positive integers m, lim{s'?} = L™. 
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Proof. Let f,g : {1/n: n € Nt} > C be given by f(1/n) = sp, g(1/n) = ty. By 
Theorem 8.5.1, limz-40 f(z) = LD and limz-,0 g(x) = K. Theorem 4.4.3 proves parts (3), 
(4), (5), (6) for f,g, hence via Theorem 8.5.1 also for s,t. Theorem 4.4.4 finishes the proof 
of (7). 

For part (1), declare f(1/n) = c and for part (2), declare f(1/n) = 1/n. Again 
Theorems 4.4.3 and 8.5.1 easily finish the proofs of (1), (2). 


The following theorem for sequences follows immediately from the corresponding 


power, polynomial, and rational rules for functions: 


Theorem 8.5.7. (Power, polynomial, rational rules for sequences) Let f be a 
polynomial function. Then lim{f(1/n)} = f(0). In particular, for any positive integer m, 
limnp+oo{1/n™} = 0. If f is a rational function that is defined at 0, then lim{f(1/n)} = 
f(0). 


Theorem 8.5.8. (The composite rule for sequences) Suppose that lim s, = L. Let 
g be a function whose domain contains L and all terms s,,. Suppose that g is continuous 
at L. Then lim g(s,) = g(L). 


Proof. Let f(1/n) = s,. By Theorem 8.5.1, lim,.9 f(x) = L, and by assump- 
tion lim,.7 g(x) = g(L). Thus by the composite function theorem (Theorem 4.4.9), 
lim, +0 9(f()) = g(Z), so that by Theorem 8.5.1, lim g(s,) = g(L). 


In particular, since the absolute value function, the real part, and the imaginary part 
functions are continuous everywhere, we immediately conclude the following: 


Theorem 8.5.9. Suppose that lims, = L. Then 
(1) in| $y) "|: 
(2) limRes, = ReL. 
(3) limIms,, = Im L. 


Since the real and imaginary parts determine a complex number, we also have: 


Theorem 8.5.10. A sequence {s,,} of complex numbers converges if and only if the se- 
quences {Re s,,} and {Ims,} of real numbers converge. 


Proof. Let f(1/n) = sn. By Theorem 4.4.8, limz49 f(x) = JL if and only if 
lim, 9 Re f(z) = ReZ and lim, .9Im f(z) = ImZL, which is simply a restatement of 


the theorem. 
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Theorem 8.5.11. (Comparison of sequences) Let s and t be convergent sequences of 
complex numbers. Suppose that |s,,| < |t,| for all except finitely many n. Then |lims,| < 
| lity t|. 

If in addition for all except finitely many n, s,,t, are real numbers with s,, < tn, 


then lim s, < limt,. 


Proof. Let A be the set of those 1/n for which |s,,| < |t,| in the first case and for which 
Sn < tn in the second case. Let f,g: A > R be the functions f(1/n) = |s,|, g(1/n) = |tn| 
in the first case, and f(1/n) = sn, g(1/n) = t, in the second case. By assumption 
for all x in the domain, f(x) < g(x). Since 0 is a limit point of the domain (despite 
omitting finitely many 1/n) and since by Theorem 8.5.1, limz-,o f(x) and lim,;-49 g(z) 
both exist, by Theorem 4.4.10, limo f(x) < limz+o g(x). In the first case, this translates 
to | lim s,,| < |limt,|, and in the second case it translates to lims, < limt,. 


Theorem 8.5.12. (The squeeze theorem for sequences) Suppose that s,t,u are se- 
quences of real numbers and that for all n € Nt, sn < tn < Un. If lims and limu both 


exist and are equal, then limt exists as well and 


liims = limt = lim w. 


Proof. Let f,g,h: {1/n:n €N*} => C be functions defined by f(1/n) = sn, g(1/n) = tn, 
h(1/n) = un. The assumption is that for all x in the domain of f,g,h, f(x) < g(x) < h(a), 
and by Theorem 8.5.1 that lim, 9 f(z) = lim, 49h(xz). Then by the squeeze theorem 
for functions (Theorem 4.4.11), limz40 f(z) = limz4og(x) = limz+4o h(x). Hence by 
Theorem 8.5.1, lims = limt = limuw. 


Example 8.5.13. lim(n + 1)!/" = 1. 


Proof. For n > 2,1 < (aye = ( hes < n'/"_ Thus by the previous theorem and 
by Example 8.2.9, lim {(att)'”” = 1. Hence by Theorem 8.5.6 and by Example 8.2.9, 


3 
2 
1 
lim{(n + rea = lim { (244) 17” niin} = lim {cash lim{n’/"} = 1. 


Exercises for this section appear at the end of Section 8.4. 


8.6 Bounded sequences, monotone sequences, ratio test 


Section 8.6: Bounded sequences, monotone sequences, ratio test 303 


Definition 8.6.1. A sequence {s,,} is bounded if there exists a positive real number B 
such that for all integers n, |s,| < B. 

If all s, are real numbers, we say that {s,,} is bounded above (resp. below) if 
there exists a real number M such that for all positive integers n, 8, < M (resp. s, > M). 
In other words, {s,} is bounded if the set {s, :n € N*} is a subset of B(0,M) for some 
real number M. 


Sequences {4 + 5i}, {1/n}, {( ati) }, {(-1)"}, {3 — 4i"}, {-425}, {7} are 
bounded (the latter by Example 1.6.7), but {n}, {(—1)"n}, (meet. {Y/nl}, {2} are not 


n 


bounded. The next theorem provides many examples of bounded sequences. 
Theorem 8.6.2. Every convergent sequence is bounded. 


Proof. Let {s,} be a convergent sequence with limit L. Thus there exists a positive integer 
N such that for all integers n > N, |s,—L| < 1. Set B = max{|s1|, |52|, |53|,...,|sa4i|, |L/+ 
1}. Then for all positive integers n < N, |s,| < B by definition of B, and for n > N, 
S| See HR ban se Ele Lie 1) oe: 


Another proof of the theorem above is given in the next section via Cauchy sequences. 


Definition 8.6.3. A sequence {s,,} of real numbers is called non-decreasing (resp. non- 
increasing, strictly increasing, strictly decreasing) if for all n, 8, < Sn41 (resp. 
Sn > $8n41; 8n < 8n41; 8n > Sn41). Any such sequence is called monotone. 


n2+1 
n 


Sequences {1/n},{—n} are strictly decreasing, { } is strictly increasing, 


2 . . . . 
nt} > is neither increasing nor 


{(-1)"n} is neither increasing nor decreasing, { 
n= +5 


n 


decreasing, but { }n>2 is strictly increasing. 


Theorem 8.6.4. (Bounded monotone sequences) Let {s,,} be a bounded sequence 
of real numbers such that for some integer N, {8,}n>n is non-decreasing (resp. non- 
increasing). Then lims, exists, and equals the least upper bound (resp. greatest lower 
bound) of the set {sn,8N+1,SN+42,---}. 


Proof. Suppose that for all n > N, sy, < Sni1. By the Least upper bound theorem 
(Theorem 3.8.5), the least upper bound of the set {sy,sn41,Sn+2,.--} exists. Call it L. 

Let € > 0. Since L is the least upper bound, there exists a positive integer N’ > N 
such that 0 < ZL — sy < . Hence for all integers n > N’, sy < Sy, so that 


0<L-—s,<L—sy <e, 


which proves that for all n > N’, |s, — L| < «. Thus lims, = L. 
The proof of the case of s, > 8,41 for all n > N is similar. 


The theorem below was already proved in Example 8.2.7. 
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Theorem 8.6.5. (Ratio test for sequences) Let r € C with |r| < 1. Then limr” = 0. 


Proof. If r = 0, the sequence is the constant zero sequence, so of course its limit is 0. Now 
suppose that r 4 0. By Exercise 2.8.2, for all positive integers n, 0 < |r|"t1 < |r|". Thus 
the sequence {|r|"} is a non-increasing sequence that is bounded below by 0 and above 
by 1. By Theorem 8.6.4, L = lim |r|" = inf{|r|” : n € N*}. Since 0 is a lower bound and 
L is the greatest of lower bounds of {|r|" :n € N*}, necessarily 0 < L. 

Suppose that L > 0. Then L(1 — |r|)/(2|r|) is a positive number. Since L is the 
infimum of the set {|r|, |r|’, |r|?,...}, there exists a positive integer N > Nj such that 
O<|r\N-L< py el By multiplying by |r| we get that 


Ir] 


= Lil 
rN <b cael ob (in go) rt) 


D D 
_ led 
2 
ea 
aj ee) . yy, 


and since L is the infimum of all powers of |r|, we get that L < |r|‘++ < L, which is a 
contradiction. So necessarily L = 0. Hence for every € > 0 there exists N € R* such that 
for all integers n > N, ||r”| — 0| < €. But this says that |r” — 0| < €, so that limr” = 0 as 
well. 


Theorem 8.6.6. (Ratio test for sequences) Let {s,,} be a sequence of non-zero complex 


numbers and let L be a real number in the interval [0,1). Assume that lim |=2++| = L or 


Sn41 
Sn 


<L. 


that there exists a positive integer K such that for any integer n > Kk, 
Then lim s,, = 0. 


Proof. Let r be a real number strictly between L and 1. Then r and r — FL are positive 


numbers. 
Under the first (limit) condition, there exists a positive number K such that for all 
integers n > K, ||"2++| — L| <r-—L. Then foralln> K, 
Srl |_| Sal b+is|[2|—2]+p<r-b4b=9 
Sn Sn Sn 


Thus both conditions say that there exists a positive K such that for all integers 
hs jee | <r. We may replace this K by any larger number, and the conclusion still 


holds. So from now on we assume that K is a positive integer. 
Let « > 0. By Theorem 8.6.5, limr” = 0. Thus there exists I > 0 such that for all 
integers n > M, |r” —0| < 7 7 


i 
SK | 


. Set N = max{M, Kk}. Let n be an integer strictly greater 
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than N. Then 
Sn Sn-1 Sn—2 Sk-+1 
|S, — O| = 7 : Giese os “SK 
Sn—1 Sn-2 Sn-3 Sk 
Sn Sn—-1 Sn—2 Sk+1 
— é uidlehs F |sK| 
Sn-1 Sn—2 Sn—3 SK 
< rr lg i| 
=r"\sK\r—* 
<e€. 


Exercises for Section 8.6 


8.6.1. Compute and justify the limits. (Use the comparison test, ratio test, decreasing 
property, or other tricks.) 
i) limp—oo sr: 
Re 
) aor 
(=a) 
) ae 
iv) limp—oo “> for all k € C. 
ii 
) 


vi) limp-sc6 a for all non-zero k € C with |k| > 1 and all integers m. 


8.6.2. Give an example of a sequence {s,,} of non-zero complex numbers such that 
lim{s,41/S,} exists and has absolute value strictly smaller than 1. 


8.6.3. Give an example of a sequence {s,,} of non-zero complex numbers such that 
lim{$n41/8n} = 7/2. 

8.6.4. Let s,, be the nth Fibonacci number. Prove that lim ae — 1473 (Hint: Exer- 
cise 1.6.30.) 


8.6.5. Let r € C satisfy |r| > 1. Prove that the sequence {r”} is not bounded. 


8.6.6. (Ratio test again, and compare to Theorem 8.6.6) Let {s,,} be a sequence of non-zero 
complex numbers such that lim “++ = L. Suppose that |L| < 1. Prove that lim s,, = 0. 


8.6.7. Consider sequences {(—1)"} and {1}. For each, determine whether it converges, 
and the limit of ratios eek Comment why the Ratio Test does not apply. 


8.6.8. Let {s,,} be a sequence of real numbers such that for all n > 1, s, < Sn41 (resp. 
Sn > 8n41). Prove that {s,,} is convergent if and only if it is bounded. 


8.6.9. Prove that the sequence { (1+ 1)"),, is strictly increasing. (Hint: Exercise 7.6.11.) 
8.6.10. Prove that the sequence { Vn! } is strictly increasing. 
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8.6.11. Let {s,} be a non-decreasing sequence. Prove that the set {s1,52,53,...} is 
bounded below. 
8.6.12. (Monotone sequences) Let {s,,} be a monotone sequence of real numbers. 
i) Suppose that {s,,} is not bounded above. Prove that {s,,} is non-decreasing and 
that lim s,, = oo. 
ii) Suppose that {s,,} is not bounded below. Prove that {s,,} is non-increasing and 
that lim s, = —oo. 


iii) Prove that lim s,, is a real number if and only if {s,,} is bounded. 


8.7 Cauchy sequences, completeness of R, C 


So far, to determine convergence of a sequence required knowing the limit. In this 
section we prove an alternate machinery for deciding that a sequence converges without 
knowing what its limit is. This is used for subsequential limits and in comparison theorems 
for convergence of series in the next chapter. 


Definition 8.7.1. A sequence {s,,} is Cauchy if for all € > 0 there exists a positive real 
number N such that for all integers m,n > N, |S — Sm| < €. 


Theorem 8.7.2. Every Cauchy sequence is bounded. 


Proof. Let {s,} be a Cauchy sequence. Thus for € = 1 there exists a positive integer N 
such that for all integers m,n > N, |S, —Sm| <1. Then the set {|s1|, |s2|,...,|sav|, |Sa-4i]} 


is a finite and hence a bounded subset of R. Let M’ an upper bound of this set, and let 
M = M' +41. It follows that for all n = 1,...,N, |sn| < M, and for n > N, |s,| = 
Sn — Sn4i + $n41| < |8n — $n4il +lsw4il <1+M'=M. Thus {s,,} is bounded by M. 


Theorem 8.7.3. Every convergent sequence is Cauchy. 


Proof. Let {s,,} be a convergent sequence. Let L be the limit. Let « > 0. Since lim s, = L, 
there exists a positive real number N such that for alln > N, |s, — L| < €/2. Thus for all 
integers m,n > N, 


Sn — Sm| = |8n -L+L—- s8m| < |s, — DL) + |L— 5m| < €/2 + 6€/2 =e. 
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Remark 8.7.4. It follows that every convergent sequence is bounded (this was already 
proved in Theorem 8.6.2). 

The converse of Theorem 8.7.3 is not true if the field in which we are working is Q. 
For example, let s,, be the decimal approximation of \/2 to n digits after the decimal point. 
Then {s,,} is a Cauchy sequence of rational numbers: for every € > 0, let N be a positive 
integer such that 1/ 10% < e. Then for all integers n,m > N, s, and s,,, differ at most 
in digits N + 1,N +2,... beyond the decimal point, so that |s, — sm| < 1/10" <e. But 
{s,} does not have a limit in Q, so that {s,,} is a Cauchy but not a convergent sequence. 


im Ol =? 


Another way of writing this is: 


10” 


But over R and C, all Cauchy sequences are convergent, as we prove next. 


Theorem 8.7.5. (Completeness of R, C) Every Cauchy sequence in R or C is conver- 
gent. 


Proof. First let {s,,} be a Cauchy sequence in R (as opposed to in C). By Theorem 8.7.2, 
{s,} is bounded. It follows that all subsets {s1, 52, 83,...} are bounded too. In particular, 
by the Least upper bound theorem (Theorem 3.8.5), un = sup{Sn, $n41, Sn42,---} is a real 
number. For all n, un > Un+1 because uy, is the supremum of a larger set. Any lower 
bound on {81, $2, 53,...} is also a lower bound on {u1, u2, u3,...}. Thus by Theorem 8.6.4, 
the monotone sequence {u,,} has a limit LD = inf{uy, ue, u3,...}. 

We claim that L = lim{s,}. Let « > 0. Since {s,,} is Cauchy, there exists Ni > 0 
such that for all integers m > n > Nj, |8n — S5m| < €/2. Thus if we fix n > Nj, then for 
all m > n, we have that s,, < s, + €/2. But uy, is the least upper bound on all s,, for 
m >n, so that sy, < Un < 8, + €/2, and in particular, s, < Un < 8, +6€/2. It follows 
that |s, — up| < €/2 for all integers n > N,. Since L = inf{uj, ue, us,...}, there exists an 
integer No such that 0 < un, —L < €/2. Set N = max{N,, No}. Let n > N be an integer. 
By the definition of the u,, D < un < un < un,, so that O <u, —L < uy, —L < €/2. 
Hence 


|Sn — L] = |S8p — Un + Un — LD] < |S — Un| + lun — LD] < €/2+6€/2 =€. 


This proves that every real Cauchy sequence converges. 

Now let {s,,} be a Cauchy sequence in C. We leave it to Exercise 8.7.1 to prove 
that then {Res,,} and {Ims,,} are Cauchy. By what we have proved for real sequences, 
there exist a,b € R such that lim Res, = a and limIms, = b. Then by Theorem 8.5.10 or 
Theorem 8.4.9, {s,,} converges to a + bi. 
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Example 8.7.6. (Harmonic sequence) The sequence {1+ 5+ 4+-::++}n is not Cauchy, 
does not converge, is monotone, and is not bounded. 


Proof. Let s, be the nth term of the sequence. If m > n, then 


1 1 1 1 m-—n n 
Sm — $n = + + a ee = —]| : 
m m-l m-2 n+1 m m 


Thus if m = 1000n, then s,, — s, > 1 — 1/1000 = 0.999. In particular, if « = 1/2, then 
|S — Sn| & €. Thus the sequence is not Cauchy. By Theorem 8.7.3 it is not convergent, 
and by Theorem 8.7.2 it is not bounded. It is certainly strictly increasing. 

So necessarily the terms of {1+ 4 + 3 feet 1} get larger and larger beyond bound. 
But this sequence gets large notoriously slowly — the 1000th term of the sequence is smaller 
than 8, the 10000th term is smaller than 10, the 100000th term is barely larger than 12. 
(One could lose patience in trying to see how this sequence grows without bound.) 

Here is a thought experiment related to this: we can stack finitely many books to 
form a bridge to the Moon and beyond! All books weigh one unit of weight and are one 
unit along the longest side. The books are stacked so that each one protrudes out from 
the heap below as much as possible and with keeping the center of mass stable. The 
topmost book, by the uniform assumption, protrudes out s of its length. Actually, for the 
book to be stable this should actually be 4 minus a tad, but that tad can be taken to be 


2 

say ——_——.,, say, or much much less. We will ignore this tiny tad, but an interested 
googolplex 

reader may wish to work through the stacking below with (varied and tinier and tinier!) 

corrections being incorporated in all the centers of mass and all the protrusions. 


This diagram shows the top two books. 


———— 


The center of mass of the top two books is clearly at 3 from the right-hand edge of the 
bottom book, so that the third book from the top down should protrude from underneath 


1 
2141-4 


the second one 3. 
The center of mass of this system, measured from the rightmost edge, is —j—2 = 


DIO 


so that the fourth book has to protrude out z- 


3-141-4 


The center of mass of this system, measured from the rightmost edge, is —{—2 = Z 
so that the fifth book has to protrude out < units. 
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n—1)-141-4 = 
t aS) 2 — 2n—1 measured 
n 2n 


In general, the center of mass of the top n books is a 
from the rightmost edge, so that the (n + 1)st book should protrude out by x units. 
Thus the total protrusion of the top n books equals 


2 ee ee 
D 2° 3 n—1 


units. We just proved that this sum grows beyond limit. Thus we can reach the Moon 
with enough books (and a platform to stand on). 


Exercises for Section 8.7 


+8.7.1. (Invoked in Theorems 8.8.4 and 8.7.5.) Prove that {s,,} is Cauchy if and only if 
{Res,} and {Im s,,} are Cauchy. 
8.7.2. Suppose that {s,} and {t,,} are Cauchy sequences. 
i) Prove that {s,, +t, } is a Cauchy sequence. 
ii) Prove that {s,,-t,} is a Cauchy sequence. 
iii) Suppose that {s,,} is Cauchy, and that for some positive number B, |t,,| > B for 
alln € Nt. Prove that {s,/t,} is Cauchy. 
iv) Prove that for all c € C, {cs,,} is a Cauchy sequence. 
8.7.3. Give examples of non-Cauchy sequences {s,,}, {t,} such that {s, +t, } is a Cauchy 
sequence. Why does this not contradict the previous exercise? Repeat for {s, - ty}. 
8.7.4. Let r € C satisfy |r| > 1. Prove that limr” diverges. (Hint: Check the Cauchy 
property.) 
8.7.5. Review the proof in Example 8.3.2 that {(—1)"} is divergent. Give another proof 
using the contrapositive of Theorem 8.7.3. 
8.7.6. Give examples of sequences with the listed properties. 
i) A Cauchy sequence in Q that is not convergent. 
ii) A bounded sequence that is not convergent. 


) 

) 
iii) A bounded sequence that is not Cauchy. 
iv) A real increasing sequence that is not Cauchy. 
) 


v) A real Cauchy sequence that is increasing. 


8.7.7. Say why the sequences below do not exist. 
i) A convergent sequence that is not Cauchy. 


ii) A Cauchy sequence in R that is not convergent. 


iii) A bounded real increasing sequence that is not Cauchy. 
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Definition 8.8.1. A subsequence of an infinite sequence {s,,} is an infinite sequence 
{Sk1, Sko1Sk3,---} Where 1 < ky < ke < kg < --- are integers. Notations for such a 
subsequence are: 


{Sknt> {Skn}n> {Skntnoir {S8kntn>1, {Skn }nent- 
Note that for all n, ky, > n. 


Examples 8.8.2. Every sequence is a subsequence of itself. Sequences {1/2n}, {1/3n}, 
{1/(2n + 1)} are subsequences of {1/n}, and {1}, {-1}, {(—1)"**} are subsequences of 
{(-1)"}. The constant sequence {1} is not a subsequence of {1/n}, because the lat- 
ter sequence does not have infinitely many terms equal to 1. If {s,} is the sequential 
enumeration of QT on page 280, then {1/n}, {n}, eran {+;} are subsequences, but 


{aaah = {1/3,1/2,...} is not. 


Theorem 8.8.3. A subsequence of a convergent sequence is convergent, with the same 
limit. A subsequence of a Cauchy sequence is Cauchy. 


Proof. Let {s,} be a convergent sequence, with limit L, and let {s,,,} be a subsequence. 
Let «€ > 0. By assumption there exists a positive number N such that for all integers 
n> N, |sn — L| < «. Since n < ky, it follows that |s,, — L| < ¢«. Thus {s,,,} converges. 
The proof of the second part is similar. 


Theorem 8.8.4. Every bounded sequence has a Cauchy subsequence. 


Proof. This proof uses the halving construction already encountered in Construction 3.14.1, 
and here the property P is that the subset contains infinitely many elements of the sequence. 

Let {s,} be a bounded sequence (of real or complex numbers). Let M be a positive 
real number such that for all n, |s,| << M. Let ag = co = —M and bo = do = M. The 
sequence {s,} has infinitely many (all) terms in the rectangle Ro = [ao, bo] x [co, do]. Set 
lo = 0. (If all s, are real, we may take co = dp = 0, or perhaps better, ignore the second 
coordinates. ) 

We prove below that for all m € N* there exists a subsequence {s;,, } all of whose 
terms are in the rectangle Ry, = [@m,bm] X [Cm,dm], where bm — @m = 2~™(bo — ao), 
[ais Ori So Gnas Oma. GaGa = 2 de — Co), en daa SC [Gnas dex |. Furthermore; 
we prove that there exists 1,, > lm —1 such that s;,, © Rm. 

Namely, given the (m — 1)st rectangle R,,-1, integer l,,-1 such that s;,,_, € Rm-1, 
and a subsequence {s,,, } all of whose terms are in R,,—1, divide R,,—1 into four equal-sized 
subrectangles. Necessarily at least one of these four subrectangles contains infinitely many 
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elements of {s,,,}, so pick one such subrectangle, and call it R,,. Therefore there exists 


a subsequence of {s,,,} that is contained in R,,, and that subsequence of {s,,,} is also a 


/ 
n? 


subsequence of {sp}. We call it {sz}. Since we have infinitely many k,,, in particular 
there exists k/, > l,-1, and we set l,, = k/,. Thus s;,, € Rm. 

By construction, {s7, }, is a subsequence of {s,},. We next prove that {s;, }n is 
a Cauchy sequence. Let € > 0. Since the either side length of the mth subrectangle R,, 
equals the corresponding side length of Ro divided by 2’, by Exercise 3.8.3 there exists 
a positive integer N such that any side length of Ry is strictly smaller than the constant 


€/2. Let m,n > N be integers. Then s;,,, 5; are in Ry, so that 


n 


lsi,, — 81,| < \/(one side length of R,,)? + (other side length of R,)? < e. 


The following is now an immediate consequence of Theorem 8.7.5: 


Theorem 8.8.5. Every bounded sequence in C has a convergent subsequence. 


Example 8.8.6. We work out the construction of a subsequence as in the proof on the 


bounded sequence {(—1)” — 1}. For example, all terms lie on the interval [ao, bo] = [—4, 4]. 
Infinitely many terms lie on [a,, b1] = [—4,0], and on this sub-interval I arbitrarily choose 
the second term, which equals 0. Infinitely many terms lie on [a2,b2] = [—4,—2], in 


particular, I choose the third term —2. After this all terms of the sequence in [a2, bz] are 
—2, so that we have built the Cauchy subsequence {0,—2,—2,—2,...} (and subsequent 
[an, bn] all have b, = —2). We could have built the Cauchy subsequence {—2,—2,...}, 
or, if we started with the interval [—8,8], we could have built the Cauchy subsequences 
{0,0,-—2, -2,...} or {—2,0,0, —2, —2,...}, and so on. 


Definition 8.8.7. A subsequential limit of a sequence {s,,} is a limit of any subsequence 
of {s,}. Thus a subsequential limit can be a complex number, or as in Definition 8.3.4, it 
can be +00. 


Here are a few examples in tabular form: 


sequence {s,,} the set of subsequential limits of {s,,} 


convergent {5,, } {lim s»,} 
Ge fh} 
(ea, {0, —2} 
{(-1)" + 1/n} Lal} 
{(-1)"n+ n+ 3} {3, co} 

{i"} {i, —1, —i, 1} 
{n} {oo} 
{(-1)"n} {—00, 00} 


312 Chapter 8: Sequences 


Theorem 8.8.8. Every unbounded sequence of real numbers has a subsequence that has 


limit —oo or oo. 


Proof. If {s,,} is not bounded, choose k; € N* such that |s,,| > 1, and once ky_1 has been 
chosen, choose an integer k, > kyn—1 such that |s,,| > n. Now {sx,,}n is a subsequence 
of {s,,}. Either infinitely many among the s,,, are positive or else infinitely many among 
the sz, are negative. Choose a subsequence {5;,}, of {5,,,}, such that all terms in {s,, } 
have the same sign. If they are all positive, then since s;, > n for all n, it follows that 
lim,-+o0 $1, = 00, and if they are all negative, then since s;, < —n for all n, it follows that 


linn 4s5-8i, = Co, 


Exercises for Section 8.8 


8.8.1. Determine the subsequential limit sets of the following sequences, and justify: 
i) {(1/2 + iV3/2)"}, {(1/2 — iV3/2)”}. 

i) {1/2”}. 

i) {(1/2 + i/3/2)" + 1/2"}. 
{1/2 144/32)? 1/2" %. 
{ Nie 


v) {(1/2 + iV3/2)” + (1/2 — iV3/2)"}. 
vi) {(1/2 + iV3/2)” - (1/2 — iV3/2)"}. 


8.8.2. Consider the sequence enumerating QT as on page 280. Prove that the set of its 


subsequential limits equals Ryo. 


8.8.3. Prove that the real-valued sequence {seh cy } is unbounded. Find a sub- 


sequence that diverges to oo and a subsequence that converges to —1. 
8.8.4. Let m € N* and let ¢ be the complex number of absolute value 1 at angle 27/m 
counterclockwise from the positive real axis. Find the set of subsequential limits of {c”}n. 


8.8.5. Give examples of sequences with the listed properties, if they exist. If they do not 
exist, justify. 
i) A sequence with no convergent subsequences. 
i) A sequence whose set of subsequential limits equals {1}. 
iii) A sequence whose set of subsequential limits equals {1,3}. 
iv) A sequence whose set of subsequential limits equals {1, 2, 12}. 


8.8.6. Suppose that a Cauchy sequence has a convergent subsequence. Prove that the 


original sequence is convergent as well. 


8.8.7. Prove that a sequence of real numbers contains a monotone subsequence. 
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8.9 Liminf, limsup for real-valued sequences 


Recall that by Theorem 3.8.5 every non-empty subset T of R bounded above has a 


least upper bound sup 7’ = lubT in R, and that every non-empty subset 7’ bounded below 


has a greatest lower bound inf T = inf T in R. We extend this definition by declaring 


sup TI’ = co if T is not bounded above, 


inf T = —oo if T is not bounded below. 


The same definitions apply to sequences thought of as sets: sup{1/n},, = 1, inf{1/n},, = 0, 
sup{n}, = oo, inf{n}, = 1, sup{(—1)"}, = 1, inf{(-1)”},» = —1, et cetera. 

Much analysis of sequences has to do with their long-term behavior rather than with 
their first three, first hundred, or first million terms — think convergence or the Cauchy 
property of sequences. Further usage of such tail-end analysis is in the next chapter (for 
convergence criteria for series). Partly with this goal in mind, we apply infima and suprema 
to sequences of tail ends of sequences: 


Definition 8.9.1. Let {s,,} be a real-valued sequence. The limit superior limsup and 


limit inferior liminf of {s,,} are: 


lim sup s, = inf{sup{s, :n >m}:m > 1}, 


lim inf s, = sup{inf{s, :n >m}:m> 1}. 


For any positive integers m1 < ma, 


inf{s, :n > m2} < sup{s, :n > mg} < sup{s, :n > my}, 
inf{s, :n > m1} < inf{s, :n > ma} < sup{s, : 2 > mo}. 


Hence for any positive integers m1,mg2, inf{s, :n > m2} < sup{s, :n > my}, so that 
lim inf s,, = sup{inf{s, :n >m}:m> 1} <inf{sup{s, :n >m}:m>1}=limsups,. 

If {s,} is bounded by A below and B above, then also A < liminf s,, < limsups, < 
B, so that in particular by the Least upper bound theorem (Theorem 3.8.5), liminf s,, and 
lim sup s,, are real numbers. 

In other words, limsups, is the infimum of the set of all the suprema of all the 
tail-end subsequences of {s,,}, and analogously, liminfs,, is the supremum of the set of 
all the infima of all the tail-end subsequences of {s,,}. In the plot below, the sequence 
{sy}, drawn with thick dots, oscillates between positive and negative values with peaks 
and valleys getting smaller and smaller. The connected non-increasing dashed top line 
denotes the sequence {sup{s,, :m >n}}, and the connected bottom non-decreasing line 
represents the sequence {inf{s,,:m >n}}n. 
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For liminf s, to be a real number it is not enough for the sequence to be bounded 
below: for example, {n} is bounded below but liminf{n} = sup{inf{n :n > m}:m> 
1} =sup{m:m > 1} =o. Similarly, for limsup s,, to be a real number it is not enough 
for the sequence to be bounded above. 


Theorem 8.9.2. If {s,} converges to L, then liminf s, = limsup s, = L. 


Proof. Let ¢ > 0. Then there exists N > 0 such that for all integers n > N, |s, — L| < e. 
Thus for all m > N, sup{s, :n >m}> L+4+., so that D—e < limsups, < L+e. Since 
this is true for all € > 0, it follows by Theorem 2.10.4 that L = limsups,. The other part 
is left to the reader. 


Remark 8.9.3. (Ratio test for sequences) With the new language, Theorem 8.6.6 
can be rephrased as follows: If {s,,} is a sequence of non-zero complex numbers such that 
lim sup{|5n41/5n|} <1, then lim s, = 0. The proof there already accomplishes this. On the 
other hand, the ratio test for divergence in Exercise 8.6.6 is not phrased in the most general 
form. One generalization is that if lim sup{|sn41/sn|} > 1, then {s,} diverges. The proof 
is simple. Let r € (1, liminf{|s,41/s,|}). By definition of liminf as supremum of some 
infima, this means that there exists an integer m such that inf{|sy41/s,|:2 > m}>r. 
Thus by an easy induction, for all n > m, |s,| > r"~™|sm|, and then by the Comparison 


test (Theorem 8.3.7), {|s,|} diverges to infinity, hence {s,,} does not converge to a complex 
number. 


It turns out that there is an important connection between limsup, liminf, and sub- 
sequential limits: 
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Theorem 8.9.4. Let {s,,} be a bounded sequence of real numbers. Then the supremum 
of the set of all subsequential limits equals limsups,, and the infimum of the set of all 
subsequential limits equals lim inf s,,. 


Proof. Proof of the limsup part only: Let A = limsup{s,}, let S be the set of all subse- 
quential limits of {s,}, and let U = sup($). Since the sequence is bounded, A and U are 
real numbers. 

Let « > 0. Since A = inf{sup{s, :n > m}:m > 1}, there exists mp > 1 such that 
sup{s, :n > mo}—A<e. Thus for all n > mo, s, — A < €. But then any subsequential 
limit of {s,,} is a subsequential limit of {s,,}n>m,, 80 that this limit must be at most A+. 
Thus A+ is an upper bound on all subsequential limits of {s,,}, so that U < A+. Since 
€ is an arbitrary positive number, by Theorem 2.10.4 this means that U < A. 

By definition of U, there exists a convergent subsequence {s;,,} such that U — 
lim{sz, }n < €/2. Let DL = lim{s,,}n. SoU —L < e€/2, and there exists a positive 
real number N such that for all integers n > N, |s,,, — L| < €/2. Thus for alln > N, 


Sp, > L—€/2 > (U —e/2) —€/2 =U —€«. 


Thus for any integer m > N, the supremum of {s,, :n > m} must be at least U — €, so 
that A, the limsup of {s,} must be at least U — «. Hence by Theorem 2.10.4 this means 
that A < U. 

It follows that A = U. 


Theorem 8.9.5. Let {s,,}, {tn} be bounded sequences in R. Then limsup s,+limsupt, > 
lim sup(s, +t) and liminf s,, + liminft, < liminf(s, + t,). 


Proof. Let a = limsup sn, b = limsupt,, c = limsup(s,+t,). By boundedness, a, b,c € R. 


Let « > 0. Recall that a = inf{sup{s, :n > m}:m > 1}. Then there exists a positive 
integer m such that 


sup{sn:n>m}—e<a<sup{s,:n>m}. 


In particular for all n > m, s, —€ < a. By possibly increasing m, we similarly get that 
for all n > m in addition t, — e€ < 6. Thus for all n > m, 8, + ty — 2e€ < a+b. Thus 
c = inf{sup{sn +tn:n >m}:m> 1} <a+b4 2. Since « is arbitrary, it follows that 
c<a-+b, which proves the first part. The rest is left as an exercise. 
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Theorem 8.9.6. Let {s,} and {t,,} be sequences of non-negative real numbers such that 
lim s,, is a positive real number L. Then limsup(sntn) = Llimsupt, and liminf(s,tp,) = 
Liiminft,. 


Proof. Let « > 0. Set e’ = min{LZ/2,¢}. By assumption there exists N > 0 such that for all 
integers n > N, |s, — L| < e. Then L—e' < s, < L +e’. Thus each s, is positive, and in 
fact |s,| > L/2. It follows that (L —')tn < Sntp < (L+ €')tn. But then since D+’ > 0, 
(L —¢’) limsupt, = limsup(L — €’)tn 
< lim sup Sptn 
< limsup(L + €’)tn 
= (L +e’) limsuptn. 


The proof of the liminf part is similar. 


Exercises for Section 8.9 


8.9.1. Let {s,} and {t,} be sequences of real numbers such that lim s,, is a positive real 
number L. Prove that lim sup(s,+t,) = L+limsupt,, and liminf(s,+t,) = L+liminft,. 
8.9.2. Compute liminf and limsup for the following sequences. Justify your work. 
jee (n)} where f is a bounded function. 
ii) {(—1)"nl}. 
iii) {2-"}. 
iv) 13 9 Bt 8 1 SO 
v) {(442)(-1)"}. 
vi) The sequence of all positive prime numbers. 
vii) The sequence of all multiplicative inverses of positive prime numbers. 
8.9.3. Prove the following for any sequence {s,,}: 
i) inf(—s,) = —sup(s,). 
ii) lim inf(—s,,) = —limsup(s,). 
8.9.4. Suppose that {s,,} converges to L. Finish the proof of Theorem 8.9.2, namely prove 
that liminf s, = L. 
8.9.5. For every positive integer n, let so,-1 = 1 and sg, = 1/2”. 
i) Compute lim inf s,,, liminf s,. 
ii) Prove that limsup + > 1. 


iii) Does this contradict the ratio test as in Remark 8.9.3. 
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8.9.6. Find bounded real-valued sequences {s,,}, {t,} such that limsup s, + limsupt, > 
lim sup(s,y, + tn). (Compare with Theorem 8.9.5.) 
8.9.7. Let {s,}, {tn} be bounded sequences in R. 

i) Finish the proof of Theorem 8.9.5, namely prove that liminfs, + liminft, < 

lim inf (sp + tn). 

ii) Find such {s,}, {t,} so that liminf s, + liminft, < liminf(s, + t,). 
8.9.8. Suppose that lims, = oo. Prove that the set of subsequential limits of {s,,} is 
empty. 
8.9.9. Finish the proof of Theorem 8.9.4, namely prove that the infimum of the set of all 


subsequential limits of a bounded sequence equals the liminf of the sequence. 


1 


8.9.10. Let {s,,} be a sequence of positive real numbers. Prove that lim sup + = iminfar’ 


Chapter 9: Infinite series and power series 


In this section we introduce infinite sums, what it means for them to make sense, 
and we introduce functions that are infinite sums of higher and higher powers of a variable 
x. The work horse of infinite sums are the geometric series, and they are almost the only 
type of infinite sums that we can compute numerically. The more technical sections on 
differentiability of power series then allow us to compute many more infinite sums. 

Warning: Finite sums are possible by the field axioms, but infinite sums need not 
make any sense at all. For example, 


ee aes aD emer ey me ge yo fom gw eg ome ert Meee co ee re 


may be taken to be 0 or 1 depending on which consecutive pairs are grouped together in 
a sum, or it could even be summed to exactly 3 by taking the first three positive 1s, and 
then matching each successive —1 in the sum with the next not-yet-used +1. In this way 
each +1 in the expression is used exactly once, so that the sum can indeed be taken to be 
3. Similarly, we can make the limit be 4, —17, et cetera. 

This should convince you that in infinite sums the order of addition matters. For 
more on the order of addition, see Theorem 9.2.8 and Exercise 9.2.16. 


Infinite sums require special handling. Limits of sequences prepared the ground. 


9.1 Infinite series 


Definition 9.1.1. For an infinite sequence {a,,} of complex numbers, define the corre- 
sponding sequence of partial sums 


{a1, 1 + a2, 41 + a2 + 43,41 + a2 +a3 + a4,...}. 


We denote the nth term of this sequence 8s, = )*\;_,a%. The (infinite) series corre- 
sponding to the sequence {an} is \>7-, a, (whether this “infinite sum” makes sense or 
not). 


When the range of indices is clear, we write simply }°, az or >> ax. 
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Example 9.1.2. For the sequence {1}, the sequence of partial sums is {n}. If a 4 1, by 
Example 1.6.4 the sequence of partial sums of {a"} is {)0;_, a®}n = as “\.,. In par- 
ticular, the sequence of partial sums of {(—1)"} is {jc} = {-1,0, a —1,0,...}. 

We have encountered shifted sequences, such as {an }n>m, and similarly there are 
shifted series: 77°. a, stands for the limit of the sequence of partials sums, but in this 
case, the nth partial sum is 8, = @m + Qm41 + Am+2+°°* + Gm+tn-1- 


Definition 9.1.3. (Most of the time and by default we take m = 1.) The series \7 7, ax 
converges to L € C if the sequence {\~¢_""** ap}, converges to L. We say then that L 
is the sum of the series and we write )-7~,,, ax = L. 

If the series does not converge, it diverges. 


Just like for sequences, when a series diverges, it may diverge to oo or to —oo, or it 
may simply have no limit. 

Since a sequence {s,} converges if and only if {s, +c} converges (where c is any 
constant), it follows that )°7°., a, converges if and only if }°°°_, ax converges, and then 


Sak = A, + AQ +++ + Am—-1 + ys Ak. 
k=1 
The following follows immediately from the corresponding results for sequences: 


Theorem 9.1.4. Let A= °7, ax, B= op, bk, and c € C. 
(1) If A,B €C, then 


St ay +cby) = A+cB. 
k=1 


(2) If all ax, b,% are real numbers and A, B € RU {oo, —co}, then 


co, ifA=oo, BE RU{co} andc>0; 

co, ifA=oo, BE RU{-c} andc< 0; 

—oo, if A= -—oo, BE RU {-co} and c > 0; 
oo —oo, if A= —oo, BE RU {oo} andc < 0; 
So (an + cbe) = oO, if ACR, B=ow andc>0; 
k=1 —oo, ifAER, B=ow andc< 0; 

OO, if AER, B= —ow andc< 0; 

—oo, ifAER, B=—-o@ andc>0; 

A, fo=0, 


Proof. (1) By assumption, the sequences {)7/'_, ax} and {}77_, b,} converge to A and B 

in C, respectively. By the theorem on the convergence of sums of sequences (Theorem 8.4.3) 

then >7)_, (@x + chk) = 7p) Gk +¢>°,_, be converges to A+ cB. 
Other parts are proved similarly. 
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Remark 9.1.5. This theorem justifies the binary operation of addition on the set of of 


convergent infinite series: 


k=1 
It is hard to immediately present examples of this because we know so few limits of 


infinite series. There are examples in the exercises. 


Theorem 9.1.6. If r € C satisfies |r| < 1, then the geometric series )\7-, r*~! 
ke? 
~ J=r’ 


1 CO 
converges to 7—, So }i,_1T 


Proof. By Example 1.6.4, }>7_, pk-l — lr" | By Theorem 8.6.5, limr” = 0. Thus by the 


l-r 
scalar and sum rules for limits of sequences (Theorem 8.5.6 or Theorem 8.4.3), 


n 


7 n k : n k-1 _ _r 
Thus limy— soo be r= Himig 465 cs i “ J-r* 


In particular, the familiar decimal expansion 0.33333 --- of 1/3 can be thought of as 
the infinite sum $77", i: The sequence of its partial sums is {0.3, 0.33, 0.333, 0.3333, ...}, 
and by the theorem above, 


a. i ae ee es om ae 
10* 10 10 SMO. rl: $e) = 
k=1 k=1 10 
A little more work is expressing 5.523 = 5.523523523523 --- (repeating 523) as a fraction: 
sexs 523 523 523 = 1 
5.523 =5 +++ = 5 +523 —— 
~ 1000 - 10002 gs 10002 - ry = (am) 
1 
a i 523 = 4995 +. 523 
= 5 + 523—1000_ _ 5 4 593 _ = 5 _ 
= 1-4 a 1000 — 1 is 999 999 
_ 5518 
~ 999° 


Example 9.1.7. The harmonic series pear i diverges to co by Example 8.7.6. 


Example 9.1.8. )°7-, eee a 


Proof. By Exercise 8.2.9, >) _, RET =i 


n 
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Example 9.1.9. The series 0°, 7s converges. 


Proof #1: By Exercise 1.6.11, 


so that {)77_, atn is a bounded increasing sequence of real numbers. By Theorem 8.6.4, 


the sequence has a limit that is at most 2, so that 07°, ¢ converges. 
Proof #2: Since k? > k(k — 1), we have that fork > 2, & < sue Thus: 3S, 442°= 
nod n 1 . 00 1k ence 1 
1+ ane <1 + Doge RET): By Example 9.1.8, the series )7¢-9 AEA) = Lk REDD 
converges, so that the increasing sequence {)7;_, a} of partial sums is bounded above, so 
that by Theorem 8.6.4, 5°72, 7 converges. 


It turns out that S77", Zz = ues but this is harder to prove. (Three different proofs 
can be found in Exercises 10.5.1, 10.5.3 and 10.5.5.) 


Theorem 9.1.10. If 5°... a, converges, then lima, = 0, and the sequence {a,} is 
bounded. 


Proof. By assumption, }>7°, a, converges, so that by definition, the sequence {s,,} of 
partial sums converges, and is thus Cauchy. In particular, for every « > 0 there exists 
N > 0 such that for all integers n > N +1, |an| = |Sn — 5n-1| < €. Thus lima, = 0, and 
by Theorem 8.6.2, {a,,} is bounded. 


The converse of this theorem is of course false; see Example 9.1.7. 


Theorem 9.1.11. Let {a,} be a sequence of non-negative real numbers. If the sequence 
{a, + a2 +--+-+4a,} of partial sums is bounded above, then >> ay converges. 


Proof. The sequence {a1 + a2 +---+a,} of partial sums is monotone and bounded above, 


so it converges by Theorem 8.6.4. 


Theorem 9.1.12. Let {a,} be a sequence of complex numbers, and let m be a positive 
integer. Then )>7~_, a, converges if and only if )77~_,,, dn converges. Furthermore in this 
case, \ op On = (a1 +42 + +++ + m—1) + Rm Gn- 


Proof. Let sn, = a, +a@2+---+4@n, and th = Gm +@m41+--:+ ay. By the constant 
and sum rules for sequences (Theorem 8.5.6 or Theorem 8.4.3), the sequence {s,}, = 
{a1 + a2 +++: +@m-1}n + {tn}n converges if the sequence {t,,} converges, and similarly, 


{tn}n = {S8n}n — {a1 + a2 +--+ +Qm_—1i}n converges if {s,,} converges. 


Exercises for Section 9.1 
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9.1.1. Let r € C satisfy |r| > 1. Prove that }77°, r* diverges. 
9.1.2. Compute and justify the following sums: )7?°, 3c, cpg ae) Donce HE 
9.1.3. Prove that 77°, PoE? converges, and find the sum. (Hint: Do some initial 
experimentation with partial sums, find a pattern for partial sums, and prove the pattern 
with mathematical induction.) 
9.1.4. Let a, = (—1)”. Prove that the sequence of partial sums {a; + a2 +-:-+ ay} is 
bounded but does not converge. How does this not contradict Theorem 9.1.11? 
9.1.5. For each k € N* let x, be an integer between 0 and 9. 
i) Prove that }>;2, 7 converges. 
ii) What does this say about decimals? 
iii) Find the sum if x; = 4 for all k. Express the sum with its decimal expansion and 
also as a ratio of two positive integers. 
iv) Find the sum if {z,} = {1, 2,3, 1, 2,3,1,2,3,1,2,3,...}. Express the sum with its 
decimal expansion and also as a ratio of two positive integers. 
(v)* Prove that whenever the sequence {x,,} is eventually periodic, then }77-, qx is a 
rational number. 
9.1.6. Prove that }°7° , ax converges if and only if }°°°., Rea, and S°°°_, Ima, converge. 
Furthermore, )77-, dk = d-p-, Rear +i do 7-, Imag. 
9.1.7. Suppose that lima, 4 0. Prove that }°7°, ax diverges. 
9.1.8. Determine with proof which series converge. 
i) Dike BE 
ii) Donat (gs + tk). 
iii) pea a 
9.1.9. Let {a,} and {b,} be complex sequences, and let m € N* such that for all n > 1, 
An = bnim- Prove that 77°, a, converges if and only if }°;°_, b, converges. 


9.2 Convergence and divergence theorems for series 


Theorem 9.2.1. (Cauchy’s criterion for series) The infinite series }* a, converges if 
and only if for all real numbers € > 0 there exists a real number N > 0 such that for all 
integersn >m> WN, 

lQm41 + Am4+2+°-°: + ay| <€. 


Proof. Suppose that 5> a, converges. This means that the sequence {s,,} of partial sums 
converges, and by Theorem 8.7.3 this means that {s,,} is a Cauchy sequence. Thus for all 
€ > 0 there exists N > 0 such that for all integers m,n > N, |Sn — Sm| < €. In particular 
forn >m>N, \ami1 + Am42 +++: +4n| = |Sn — Sm| <€. 
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Now suppose that for all real numbers € > 0 there exists a real number N > 0 such 
that for all integers n > m > N, |am4i + Gm42 +++: +@n| < ¢€. This means that the 
sequence {s,,} of partial sums is Cauchy. By Theorem 8.7.5, {s,,} is convergent. Then by 


the definition of series, )°, a; converges. 


Theorem 9.2.2. (Comparison test (for series)) Let {a,} be a real and {b,} a complex 
sequence. such that for all n, an > |bn|- 

(1) If \° a, converges then )*> b;, converges. 

(2) If \> by diverges then )~* ax diverges. 


Proof. Note that all a, are non-negative, and for all integers n > m, 
om41 + Om42 ++++ + bn| = |Om+1| a |Pm+2| Sa [On| 
< Qm4+1 + Gmt2 +++: + Gn 
= lQm41 + Am+2 Spee Bal 


If }> ax converges, by Theorem 9.2.1, for every € > 0 there exists N > 0 such that for all 
integers n >m > N, |Q@m4i1+@m+42+-::+a@n| < €, and hence |by,41+ bm42+---+bn| <e. 
Thus again by Theorem 9.2.1, 5) 6b; converges. This proves the first part. 


The second part is the converse of the first. 


Theorem 9.2.3. (Ratio test) (Compare to Remark 8.9.3.) Let {a,} be a sequence of 
non-zero complex numbers. 
(1) If limsup |“**) < 1, then >? |ax| and 7 ax converge. 


(2) If lim inf 


> 1, then S> |a,| and S> a, diverge. 


Qn4+1 
an 


Proof. Let L = lim sup | "+" |. Suppose that L < 1. Let r be a real number in the 
open interval (L,1). Since L = inf {sup{ n+! :n>m}:m> 1} andr > £, it fol- 
lows that there exists m > 1 such that r > sup{ fat! :n > m}. Thus for all n > m, 
l@n4il < T|@n|. Thus by Exercise 1.6.22, |@min| < 7T”|@m|. The geometric series >, r” 
converges by Theorem 9.1.6, and by Theorem 9.1.4, 5°, amr* converges. Thus by Theo- 
rem 9.2.2, 07°, |@m+r| and 77°, Gm4% converge. Hence by Theorem 9.1.12, }> |a,| and 
> az converge. This proves (1). 

Now let L = liminf |“**|, and suppose that L > 1. Let r be a real number in the 


an 


open interval (1, LZ). Since L = sup {inf {al :n>m}:m>1} andr < L, it follows that 


there exists m > 1 such that r < inf{ lantitl 7 > m}. Thus for all n > m, |an4i| > rlan|. 


|an| 


Thus by a straightforward modification of Exercise 1.6.22, |am4n| > r”|am|. The geometric 


series }>, r* diverges by Exercise 9.1.1, and by Theorem 9.1.4, 5°), @mr* diverges. Thus by 
Theorem 9.2.2, 77°, |@m+s| and S772, m+ diverge. Hence by Theorem 9.1.12, >> |a,| 
and )> a, diverge. This proves (2). 
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This ratio test for convergence of series does not apply when lim sup | aa =lor 
lim inf |“2**| = 1. The reason is that under these assumptions the series }>,,|a,| and 


>>, ak Sometimes converge and sometimes diverge. For example, if a, = 1/n for all n, 


lim sup |““++| = lim inf |“*++| = 1, and $*?_. 1 diverges; whereas if a, = 1/n? for all n 
: An ’ k=1k ’ ’ 


an 


then lim sup |““++| = lim inf |““*+| = 1, and 3*”_, 4 converges. 
An An ’ k=1k 


Theorem 9.2.4. (Root test for series) Let {a,,} be a sequence of complex numbers. 
Let L = limsup|a,|1/”. 
(1) If L <1, then )¢, |ax|, >0, ax converge. 


(2) If L > 1, then $°, |ax|, >5, a% diverge. 


Proof. If L < 1, choose r € (L,1). Since L = inf{sup{|a,|/":n >m}:m>1} andr > L, 
there exists m > 1 such that r > sup{|a,|!/" : n > m}. Thus for all n > m, r” > |an|. 
Thus by the Comparison test (Theorem 9.2.2), since the geometric series > r* converges, 


we have that 5° a, and >> |a,| converge. The proof of (2) is similar, and is omitted here. 


Theorem 9.2.5. (Alternating series test) If {a,,} is a non-increasing sequence of pos- 
itive real numbers such that lima, = 0. Then )>7-_,(—1)*ax converges. 


Proof. Let m,n be positive integers. Then 


0 & (Ga, —_ Gn+41) + (Qn42 — An+3) a es (Qntom = On+2m+1) 


= An — Anti + An+2 — An+3 2 An+2m — An+2m+1 


= An— (Qn41 —An+42) = (An43 _ An+4) ares (Cee, perme —An+2m) —An+2m+1 
< Gn, 


and similarly 
OS Gy Op eo = Ops Foe Oya = On. 


Thus by Cauchy’s criterion Theorem 9.2.1, }>,(—1)"a, converges. 


Example 9.2.6. Recall from Example 9.1.7 that the harmonic series }°, 1/k diverges. 
But the alternating series }>,(—1)"/k converges by this theorem. (In fact, }>,(—1)*/k 
converges to —1n2, but proving the limit is harder — see the proof after Example 9.7.7. 
We examine this infinite series more carefully: 
Hn Ty oe. Ot ee hth SY 


1 
= 3° 4 56 778 9 * 10 


We cannot rearrange the terms in this series as (5 + 7 + t + i + in +--+) minus 
el a 5 ee z F 5 + 5 ee te -) because both of these series diverge to infinity. More on 
changing the order of summation is in Exercise 9.2.16 and in Theorem 9.2.8. 
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Definition 9.2.7. A series )>7° , ay, is called absolutely convergent if S>7~, |a,| con- 


verges. 


Theorem 9.2.8. Let )°)~, a, be absolutely convergent. Let r : Nt > Nt be a bijective 
function. Then > 7", a(x) converges. 


Proof. By comparison test (Theorem 9.2.2), )77°., a, converges to some number L € C. 
We will prove that }°7°, Ar(k) converges to L. 

Let « > 0. Since )77°., ax = L, there exists No > 0 such that for all integers n > No, 
[yop @k — L| < €/2. Since °°", |ax| converges, by Cauchy’s criterion for sequences there 
exists N; > 0 such that for all integers n > m > M1, Wypimy1 lanl < €/2. Pick an 
integer N > max{No, Ni}. Since r is a bijective and hence an invertible function, we can 
define M = max{r~'(1),r~1(2),...,r7'(N)}. Let n be an integer strictly bigger than M. 
Then by definition the set {r(1),7r(2),...,r(m)} contains 1,2,...,N. Let K = {r(k):k < 
n}\ {1,2,...,N}. Then 


n N 
So arn) — £ Sax — L 
k=1 k=1 


since all the finitely many indices in K are strictly bigger than No. 


= < 


+ S° lax] <e 


kek 


N 
Slant Sa, - L 
k=1 


kek 


Theorem 9.2.9. (Integral test for series convergence) Let f : [1,00) — |0,00) bea 
decreasing function. Suppose that for all n € N*, Jy f exists. Then )°7-, f(k) converges 
if and only if limy_+6 i f exists and is a real number. 

(It is not necessarily the case that \P-_, f(k) equals limn.o0 f7" f.) 


Proof. Since f is decreasing, for all x € [n,n+ 1], f(n) > f(x) > f(n+1). Thus 
n+1 n+1 n+1 


f(nt))= i f(n+ de < [ f(a)dx < / f(n)dx = f(n). 


Suppose that >°, f(k) converges. Then by the definition this means that 
limnsoo(f(1) + f(2) +--- + f(n)) exists. By the displayed inequalities, ne = 
eo + Ee ap each ef ieee < f(1)+ f(2)+--- + f(n), so that 0 hae f}n is a bounded 
increasing sequence of real numbers, so that limn+. the _ f exists, and hence that 
limi a3 if f exists. 

Conversely, suppose that limps is f exists. Let ZL € R be this limit. By the 
displayed inequalities, f(2) + f(3) +---+ f(n+1) < i foot (a f= ie f. Since 
f takes on only non-negative values, this says that f(2) + f(3) +---+ f(n+1) < L. Thus 
{f(2) +---+ f(n+1)}, is a non-decreasing sequence that is bounded above by L. Thus 
by Theorem 8.6.4, this sequence converges. By adding the constant f(1), the sequence 
{f(1)+---+ f(n+1)}n converges, so that by the definition of series, )°, f(k) converges. 
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Theorem 9.2.10. (The p-series convergence test) Let p be a real number. The series 
>>, k? converges if p < —1 and diverges if p > —1. 


Proof. If p = —1, then the series is the harmonic series and hence diverges. If p > —1, 
then n? > n7~! for all n by Theorem 7.6.5. Thus by the comparison test (Theorem 9.2.2), 
>>, k? diverges. 

Now suppose that p < —1. The function f : [l,oo) > R given by f(x) = 2? is 
differentiable, continuous, and decreasing. Since f is continuous, for all positive integers n, 
fi’ f exists. By the Fundamental theorem of calculus, [;" f = f/' ? dx = a —. By the 
composite rule for sequences (either Theorem 8.5.8 or Theorem 8.4.7), slice the function 


that exponentiates by the positive —(p+1) is continuous at all real numbers and lim 1 =, 
it jue that limn?+! = lim (4) ~ Or) = pera 0, so that lim J," f exists and equals 
ae Thus by the Integral test (Theorem 9.2.9), 5°, k? converges. 


Exercises for Section 9.2 


(-1)* 


\f 
9.2.1. Prove that yo @EpF Loko OEHY rea w and DR, Ge alll converge. 
9.2.2. Prove that the following statements are equivalent. (They are also all true, but we 
do not yet have enough methods to prove them.) 
oo 1 1 
(1) Doe-o GEED? = S- 
ioe) 1? 
(2) dept e = igs 
Seay. mn 
(3) Veo caer =. Ge 
(4) oes ae = 7 
(Hint for a part: Write out the first few summands of >>, cy 5 ope ee 
oe) —1)* 
-_ k=l ce + 2 kel aE) 
9.2.3. For each of the following series, determine with proof whether they converge or 
diverge. You may need to use Examples 8.2.9 and 8.3.6. 


3t 
i) : —_—_. 
oo VR 
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v) ‘e: 
k=1 
: 1 
vi —. 
) 2p 
9.2.4. Find a convergent series )>, a, and a divergent series )~,, by with limsup |a,|!/" = 
lim sup |b, |1/" = 1. 
9.2.5. Make a list of all encountered criteria of convergence for series. 


9.2.6. The goal of this exercise is to show that if the ratio test (Theorem 9.2.3) determines 
the convergence/divergence of a series, then the root test (Theorem 9.2.4) determines it as 
well. Let {a,,} be a sequence of non-zero complex numbers. 

i) Suppose that lim sup | <1. Prove that lim sup |a,|!/" < 1. 

ii) Suppose that lim inf |“}*+] > 1. Prove that lim sup lange SA, 


9.2.7. Apply the ratio test (Theorem 9.2.3) and the root test (Theorem 9.2.4) to 77", a 
Was one test easier? Repeat for 77°, 7. 

9.2.8. Let {a,,} be a complex sequence, and let c € C. Is it true that }>7°_, a, converges 
if and only if }°7°., ca, converges? If true, prove; if false, give a counterexample. 

9.2.9. Let a, = (—1)"/n, b, = 2(-1)”. Prove that }°7°., a, converges, that for all n, 
Jbn| = 2, and that )7;", ¢* diverges. 

9.2.10. (Compare with Exercise 9.2.9.) Suppose that 5°7~ , |ax| converges. Let {b,} be a 
sequence of complex numbers such that for all n, |b,| > 1. Prove that )77° , i converges. 


9.2.11. (Summation by parts) Let {a,,}, {b,} be complex sequences. Prove that 


S Wea Wo Gene e 
k=1 k=1 k=1 j=l 
(Hint: Set ag = bo = 0. Let f,g : [1,00) - C be defined as follows: for each n € N*, on the 
interval [n,n+1), f is the the constant function a, —a,_, and g is the constant function b,. 
Both f and g are piecewise continuous, F'(n) = ads , =O, C= ae G= Ds 3 Vie 
The problem should remind you of integration by parts [(F'g) = FG — |(fG).) 
9.2.12. Let x € C with |z| < 1 and x # 1. Prove that >”, a converges. (Hint: 
Exercise 9.2.11, Example 1.6.4.) 
9.2.13. Prove that if 77°, a, converges then )°>;°, 5* converges. (Hint: Exercise 9.2.11.) 
9.2.14. (The Dirichlet test) Let {a,,} be a decreasing real sequence such that lima, = 0, 
and let {b,,} be a sequence of complex numbers whose sequence of partial sums is bounded. 
i) Prove that the series )°7° , a,b, converges. (Hint: Exercise 9.2.11.) 
ii) Prove the alternating series test using part i). 
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9.2.15. The following guides through another proof of the p-series convergence test for 
p< —1. (Confer Theorem 9.2.10 for the first proof). 
i) Prove that for each positive integer n there exists c € (n,n+1) such that (p+1)c? = 
(n+ 1)?*+ —nP*!. (Hint: Mean value theorem (Theorem 6.3.4).) 
ii) Prove that for all positive integers n, 
(n+ Lert — nptt ee mpti — (n— Ler 


n+1)jP< 
( ) p+l1 p+l1 


iii) Prove by induction on n > 1 that 
eee 
| p+l1 p+l1 


iv) Prove that the positive sequence {)7/_, k?}» of partial sums is an increasing se- 
quence bounded above. 


v) Prove that the sequence {7/_, k?},, converges, and so that 577°, k? converges. 


9.2.16. (Order of summation in infinite sums is important.) 


i) Prove that S772, s and OP., a44 diverge. Refer to Example 9.2.6. 


k 
ii) Observe that )7y_, x =>) y o = ae (1) . 
k 
iii) Argue that S77, Ch) oO ae a SIGE Wy daca eie ueecontaties 


the “expected” summation and difference rules? 


9.2.17. (Raabe’s test) Let a), a2,... be positive real numbers such that for some a > 1 
and for some N € N, — <1- © for alln > N. Prove that }/,, an converges. (Hint: 
Let f(x) = 1— 2°. Use the Mean value theorem to get c € (x, 1) such that f’(c)(1 — 2) = 
f(1) — f(z). Conclude that 1— 2° < (1—2)a. Apply this to =1— +. Use that >n~° 
converges. ) 


9.3. Power series 


In this section we deal with sums where the index varies through No, and furthermore, 
the terms of the sequence are special functions rather than constants: 


Definition 9.3.1. A power series is an infinite series of the form 


CO 
S a,e* = y apt” = ag + aye + agen” + age? +---, 
k=0 k>0 


where ao, @1,@2,... are fixed complex numbers, and x is a variable that can be replaced by 
any complex number. (By convention as on page 34, 0° = 1.) 
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The table below identifies the coefficients a, of x” in several powers series. 


power series An 
d= 1 
k=0 
SoHo t art ah tay... 1, if n is odd; 
ner 0, if n is even. 
0° 0, if n is odd; 
Slat Bog t git gl ei84. - 0, if nis even and n < 12; 
k=0 1, if n> 12 is even. 
ye! ; 
k=0 
— n 
! n 
= k! 
> (ha)? nr 
k=0 


The partial sums of power series are polynomials, so a power series is a limit of 
polynomials. 

A power series is a function of x, and the domain is to be determined. Clearly, 0 
is in the domain of every power series: plugging in x = 0 returns oaae a,0* = ao. If all 
except finitely many a, are 0, then the power series is actually a polynomial, and is thus 
defined on all of C. When 1 is in the domain, evaluation of the power series )*, a,x* at 
x = 1 is the (ordinary) series 5°, ax. 

The most important question that we address in this section is: which x are in the 
domain of the power series, i.e., for which x does such an infinite series converge. We prove 
that for every power series whose domain is not all of C there exists a non-negative real 
number R such that the series converges for all z € C with |z| < R and the series diverges 
for all x € C with |x| > R. What happens at x with |z| = R depends on the series. 


Example 9.3.2. Let f(r) = °?2.) 2%. By Theorem 9.1.6, the domain of f contains all 
complex numbers with absolute value strictly smaller than 1, and by Theorem 9.1.10, the 
domain of f contains no other numbers, so that the domain equals {x € C: |z| < 1}. 
Moreover, for all x in the domain of f, by Theorem 9.1.6, S>;".5 a= x. Note that the 
domain of ok is strictly larger than the domain of f. 

Whereas for general power series it is impossible to get a true numerical infinite sum, 
for geometric series this is easy: f($) = RDO = =e =2, f() = e>o = = = 3, 


(0.6) = ys 0.6" = pg = 2.5. 
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Theorem 9.3.3. (Root test for the convergence of power series) Let S> a,x" be a 
power series, and let a = limsup |a,,|!/". Define R by 
1/a; if 0 <a<00; 
R= 4 0, ifa=o; 
oo, dia: = 0, 
Then for all x € C with |x| < R, Y>|az||z|* and $+ azx* converge in C, and for all x € C 
with |x| > R, > |ax||2|* and S* axx* diverge. 


Proof. By the definition of limits, a is either a non-negative real number or oo. We apply 
the Root test for series (Theorem 9.2.4): limsup jay,2”|!/” = |a|limsup |a,|!/" = |ala. If 
|z|a <1, then both of the series converge, and if |z|a > 1, then the two series diverge. If 
a = 0, then |z|a < 1 is true for all x € C, so R = oo has the stated property. If a = on, 
then |x|a < 1 is true only for x = 0, so R = 0 has the stated property. If 0 < a < co, then 


|x|a < 1 is true only for all z € C with |z| < 1/a= R. 


Definition 9.3.4. The R from Theorem 9.3.3 is called the radius of convergence of 


the series )> apa”. 


This is really a radius of convergence because inside the circle B(0,R) the series 
converges and outside of the circle the series diverges. Whether the power series converges 
at points on the circle depends on the series; see Example 9.3.6. 


Theorem 9.3.5. The series Sl apx*, Y > laz|a*, >> Jag||a|*®, S > kap|a|*, SY kagae®-t, 
Sk apa”, S>k(k — 1)azx*, all have the same radius of convergence. 


Proof. By Example 8.2.9, lim %/|n| = 1. For any integer n > 2,1< Wn—1< Yn, so that 
by the squeeze theorem, lim Yn — 1 = 1. Thus by Theorem 8.9.6, limsup %/|n(n — 1)an| = 


lim 7~/n(n — 1)-limsup *%/|a,| = limsup ~/|a,|. This proves that the a as in Theorem 9.3.3 
for )\ axa" is the same as the a for > k(k — 1)axx*, which proves that these two power 


series have the same radius of convergence. The proofs of the other parts are similar. 


Example 9.3.6. We have seen that )~ x" has radius of convergence 1. By Theorem 9.1.10, 
this series does not converge at any point on the unit circle. By the previous theorem, 


k is also 1. By the p-series test 


the radius of convergence of )77°., ¢2" and of PY. , fa 
(Theorem 9.2.10) or by the harmonic series fact, 0°, <2" diverges at « = 1, and by the 
alternating series test (Theorem 9.2.5), 77°, ea. converges at x = —1. By the p-series 
test (Theorem 9.2.10), 77°, 7 converges, so that by the comparison test Theorem 9.2.2, 


yp, ze converges on the unit circle. 
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Theorem 9.3.7. Let the radius of convergence for )>a,7* be R. Then the radius of 
convergence for )> apx2® is co if R= o and it is VR otherwise. 


Proof. Let a = limsup (~/|a,,|. In the second power series, the nth coefficient is 0 if n is 


odd, and it is a,/22" if n is even. Then applying the root test to this power series gives 


lim sup { v0, Vail, V0, V |aal, v0, Vv 
= limsup { “jail, /|a, /\asl, Vlas 


(Os do not contribute to 


as3|, V0, Vaal, o | 


us 


limsup of non-negative numbers) 


= lim sup {VV V jaal, / Vlas 


V Viai....} 


= s[limsup { fark Taal. Vaal eal ...} 
Zafe 


and the conclusion follows. 


Example 9.3.8. Similarly to the last example and by Theorem 9.3.5, Sa", So ke*, 
\ k2a*, Sk(k —1)2", D> x?**1 all have radius of convergence 1. 


Theorem 9.3.9. (Ratio test for the convergence of power series) Suppose that all 


Gy, are non-zero complex numbers. 


(1) If |x| < liminf| |, then 37 |ax||a|* and $7 axa* converge. 
: an k k aw 
(2). If |) >> lim sup| a7 |) then Y= |ax||z|* and $5 a,x” diverge. 


Thus if lim| ot : | exists, it equals the radius of convergence of S~ agx*. 


Warning: Compare with the Ratio test for convergence of series (Theorem 9.2.3) 
where fractions are different. Explain to yourself why that is necessarily so, possibly after 
going through the proof below. 


Proof. The two series converge in case x = 0, so that we may assume that x #0. We may 
then apply the Ratio test for convergence of series (Theorem 9.2.3): 


n+1 a 
Himsup | = 12 im sup ss 
An x” n 
as int{sup {| nz mh a {ee 
. 1 
=l¢\int< - = ied 
inf {| “",|:n > m} 


[2 


sup {inf {| =| coh eal ch aren fe i} 
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[a 
an 
dni 


~ jiminf | 
If this is strictly smaller than 1, then the two series converge, which proves (1). Similarly, 


Cera || 


lim inf 


An x” 


~ lim sup| a | 


and if this is strictly larger than 1, then the two series diverge. The last part is then 


immediate by the definition of radius of convergence. 


Examples 9.3.10. 

(1) In Example 9.3.8 we established via the root test that )>a?*t! has radius of 
convergence 1. The ratio test is inapplicable for this power series. However, note 
that S> 2?**+1 = x57 (ax?)*, and by the ratio test for series (not power series), this 
series converges for non-zero x if limsup |(a?)**1/(ax?)*| < 1, ie., if |z?| < 1, ie, 
if |z| < 1, and it diverges if |x| > 1. 

(2) The radius of convergence of 77°, er is oo. For this we apply the root test: 
a = limsup [agen — lim sup + = 0. 

(3) By the ratio test, the radius of convergence of }>7"_, = is lim| == | = lim (nti! = 


lim(n +1) = oo. The root test gives a = lim sup |1/n!|!/"| = lim sup(1/n!)!/”, and 


by Example 8.3.6 this is 0. Thus the radius of convergence is co also by the root 
test. 


Exercises for Section 9.3 


9.3.1. Let the radius of convergence of )77° a,x" be R. Let c€ C. 
i) Prove that c- 772.9 anx* = TP (c-ax)x* is convergent with radius of convergence 
equal to R if c#0, and with radius of convergence oo otherwise. 
ii) Prove that )7?°., a,(cx)* is convergent with radius of convergence equal to oo if 
c= 0 and radius R/|c| otherwise. 


9.3.2. Suppose that >°, a, converges for some a,x € C. 
i) Prove that the function defined as the power series )>,, a,x* has radius of conver- 
gence at least 1. 
ii) Give an example of a, € C for which the radius of convergence of 77°, axx* is 
strictly greater than 1. (Bonus points for easiest example.) 
iii) Give an example of a, € C for which the radius of convergence of 7°, apx™ is 
equal to 1. 
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9.3.3. Compare with the previous exercise: find the radii of convergence of the power series 
a aol 0 2) ae eo I ee (28) 
9.3.4. cpa and justify the radius of convergence for the following series: 


1 


1M | 


ist 


ery 


xf. 


i) 
i) (3 
iti) > 3kx". 
iv) DU k(3 a)*. 
Ds a 
vi) 3 
vii) ee : 


9.3.5. Compute and justify the radius of convergence for the following series: 
i) 
ii) 

iii) se ca 
9.3.6. Let ° pape” and )~7°_, byx® be convergent power series with radii of convergence 

k=0 k=0 

R, and Rg, respectively. Let R = min{R,,R2}. Prove that yo, anx*® + Pg bpe® = 
y pp(@x+bi.) x" is convergent with radius of convergence at least R. (Hint: Theorem 9.1.4.) 


Po 


8 


oe 


ate 
7 


9.3.7. Let R be the radius of convergence of }7 7°, a,x". Let p be a positive integer. 
i) Determine the radius of convergence of payee ape, 
ii) Determine the radius of convergence of es apuP®. 


9.3.8. What would be a sensible definition for generalized power series 77°) ax(x—a)*? 
What would be a sensible definition of the radius of convergence of )77°_) ax(w—a)*? Draw 
a relevant picture in C. 


9.4 Differentiation of power series 


Power series are functions. In this section we prove that they are differentiable at all 
x inside the circle of convergence. Since a differentiable function is continuous, it follows 
that a power series is continuous inside the circle of convergence. 
Recall that for any differentiable function f, 
Gh) fe 
f(a) = Yim LEI) 


h—0 h . 


and any power series )>7° 9 a2" is the limit of a sequence: )>7°_, anv* = lim{}>)_, anv™ Fn. 
Thus 


_. limf{*?_, ag(a+ h)*}y — lim 8 age hy 
(So axa’)! = lim epee. a ( ) } {>= 6 k } 
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Certainly by the sum rule for convergent series, lim{}7/_) ax(t+h)*}n—lim{> > pg an2" }n 
lim{)> yp @x((a + h)* — x*)},, and by the constant rule we get that 


~ ky) any 
oF ay = = fi tim aE 


If we could change the order of limits, then we would get by the polynomial rule for 


derivatives that 
CO n 
(S a,x") = lim lim ; kaya*—1 
noo h->0 
k=0 k=0 


In fact, it turns out that this is the correct derivative, but our reasoning above was based 
on an unproven (and generally false) switch of the two limits. 

We give a correct proof of derivatives in the rest of the section. By Theorem 9.3.5 
we already know that the series 07°, anv" and )77°., kagx*®—! = S77°., kagx*! have the 
same radius of convergence. 

The following theorem is not necessarily interesting in its own right, but it is a 
stepping stone in the proof of derivatives of power series. 


Theorem 9.4.1. Let >, a,x" have radius convergence R. Let c € C satisfy |c| < R. Then 
the function g(x) = DvP, an(z*-1 + ca®-? 4 cPak-3 + --- + ck!) is defined on B(0, R) 
and is continuous at c. 


Proof. There is nothing to prove if R = 0, so we may assume that R > 0. 

First of all, S02, kayx*~1 converges on B(0,R) by Theorem 9.3.5. In particular, 
ope kanck) = SOP, ag (8! + eck? + c2ck-3 +... 4+ ch) is well-defined, but this is 
simply g(c), so that c is in the domain of g. 

Let € > 0 and let d € R satisfy |c| < d < R. Then again by Theorem 9.3.5, d is in the 
domain of >, azz", S>, kaya*—!, and also 37, k(k — 1)aya*~? and S>, k(k —1)|ax||a|*~?. 
Set D= $>>, k(k — 1)|az|d*~?, and 6 = min{d — |c|, pri}. Then 4 is positive. Let ce C 
with 0 < |x —c| < 6. Then by the triangle inequality, |a| = ja —c+c| < |ja—c|+4+ |e] < 
6+ |c| <d-—|c|+|c| =d < R, so that x is in the domain of the power series )>,, a,x". 

But is x in the domain of g? Since the radius of convergence of }°, a,x* is R, by 
Theorem 9.3.5 also the radius of convergence of 5>, kayx*—! is R, so that >>, kaxd*~1 


converges. Then from the Comparison theorem (Theorem 9.2.2) and from 


lay(a*—) + ce? + gh 3 +e cht) 
< |ax|(|a|*~* + lel|a|*-? + Je|?|x|*-? +--+ + Jel“) 
(by the triangle inequality) 
< jag|(d*—? + dd®-? + d?d®-3 +... + d’-") 
= kla,|d*-1 
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we deduce that x is in the domain of g. Furthermore, 


Ig(x) — gle)| 

= So an (a*? en 2 tg h 3 4s ght) — Os kapc*—*| 
=|Soax(a** + ca*? + Pak 3 4... 4 c8 1 — ke* 4) 

_ S- ag (ak? — 1) + (cg*-? — 1) 

+ (2ak-3 — oh-l) 4... 4 (eR PY 

= So ag (ak? — 1) + (gt? — c#-2) 

+ c?(a*-3 — c&-9) +... +. c% * (x — 0))| 

= So ax(x —c)((a*®~? + a3 e+ ch -4C? +... + ch?) 


+ c(x*-? + 2* 464 gh 9c? +... + eF3) 
+c? (ak4 + gk Pe 4 gh 82 +... +8 4) +--+ 8?) 
(by Exercise 1.6.21) 
Y_ lavlle — cl((la|*-? + |x| Fc] + [al* A |e)? + --- + [el*-?) 
+ |el(|a|*~? + jx| Ae] + fal? |el? +--+ + fel*~*) 


lela =a ele Pele fal Sle ater eel) seen ect lel?) 


IA 


(by the triangle inequality) 
< |x —c| S— |ag|((d*-? + d* 3d + dh 4a? +++. + d*-*) 
pad? aed ded’ Pd? each?) 
+ d?(d*-* +. dd d? Sa? 43-24 d* 4) ++. 4+d*-?) 


= |x —c| — |ax|((k — 1)d*-? + (k — 2)d*-7 +-(k — 3)d*-? +---4.*-?) 
= |x —c| Ss Pate Ye ? (by Example 1.6.1) 

= |x —c|D 

< 6D 

<.€. 


This proves that g is continuous at c. 
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Theorem 9.4.2. Let f(x) = >?) a2" have radius of convergence R. Then f is differen- 
tiable on B(0, R) and f'(x) = 7p) kaga’—? = SOP, kayx*+. The radius of convergence 
of f’ equals R. 


Proof. Let c,x € B(0,R). Then 
f(x) -— Ff) _ Lin ana* — Diy anc 
£-C L-—C 

= a On mt ce) 

7 £-C 

— yan (@ — 0) (a*? + w* 20 + wh 3c? +--+ 81) 

E-C 
(by Exercise 1.6.21) 


= Cae gh gh et pee), 
k 


which is the function g from the previous theorem. In that theorem we proved that g is 
continuous at c, so that 


f'(c) = lim F(a) = Ke) = lim g(x) = g(c) = ye 
k=1 


LC x-C LC 


Then the theorem follows from Theorem 9.3.5. 


Theorem 9.4.3. Suppose that the power series )>7~ ¢ a,x" has radius of convergence R 
and that the power series >; 9 b,2* has radius of convergence S. Suppose that for some 
positive r < R,S the two power series take on the same values at all z € B,(0). Then for 
alln > 0, an = by. 

In other words, if for all x € B,(0), S7P-9 anx” = 77 bex*, then the two series are 


identical. 


Proof. Let f(x) = 7p.9 agx* and g(x) = YP, bea”. By assumption ao = f(0) = g(0) 
bo. Let n € NT. By Theorem 9.4.2, f and g are differentiable on B,(0), and since they 
are identical, their nth derivatives agree on B,.(0) as well. By an n-fold application of the 


derivative, 


f(z) = s apk(k —1)(k —2)---(kK-—n+1)x*-”, 
k=n 


and similarly for g). But then 
ann(n — 1)(n —2)-+-(n—n+1) = f™(0) 
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so that a, = by. 


Exercises for Section 9.4 


9.4.1. Consider the function f(x) = ;— and its corresponding geometric series on B(0, 1). 
i) Compute a derivative of f as a rational function and as a power series. 


ii) Compute an antiderivative of f as a rational function and as a power series. 
g2ktt 


9.4.2. Let nace i ys 0 @kEDE!: 


i) Find the radius of convergence of the series. 

ii) Compute the derivative of f. 

iii) By Example 9.7.5 we know that for every real x, e” = 07°, ae Find the series 
for e(@”), 

iv) Find a power series whose derivative is e(@), (Comment: a” might stand for (a?)° 
or for a"). But the first form is simply a?°, so we would not write a” to stand for 
that. So, it is standard notation to write a when we mean a‘), In particular, 
e(@”) can be written more plainly as e*) 


v) Address the no-closed-form discussion on page 259. 


{9.4.3. (Invoked in Theorem 9.7.3.) Let f(x) = 772, anx* be a power series. Prove that 
for alln € No, 


{Mca = Daukth —1)(K-2)---(kK-n+4+1)a = Saga ae 


9.5 Numerical evaluations of some series 


Differentiation of power series is a ee tool. For all complex numbers x € - (01) 
the geometric series rie p= gk converges to z=; . Certainly it is easier to compute 7—, ; than 
the infinite sum. We can exploit geometric series and derivatives of power series to compute 
many other infinite sums. Below we provide a few illustrations of the method. 


Example 9.5.1. )7?°, sé = 4. 


Proof. Let f(x ) = =P, 2. This is the geometric series with radius of convergence 1 that 
converges to =~ (Example 9.3.2). By Theorem 9.4.2, f’(z) = 0p.) ka*~ + = SOP, ka*® 

and by ihesean 9.3.5, the radius of convergence of f’ is also 1. Thus $ is in the domain 
of f’. Since we have two ways of expressing f (as power series and as a rational function), 
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there is similarly also a second form for f’: 


re) (S5) - a 


From the two forms we deduce that 7-2-9 34+ = op k(4)*~ a7 2) = aay =A. 


Example 9.5.2. )°7°, = 6, and )7y~0 sa = 12. 


Proof. As in the previous example we start with the geometric series f(x) = 0P2,x* 
that converges on B(0,1). Its derivative f’(z) = Py ka*-! = ae also converges 
on B(0,1). Then xf'(x) = P.,kx*® and its derivative (xf’(x))’ = 72 k?2*! also 
converge on B(0,1). From 


= DAL, a2 Hoseth Ni is a, ee 
Dia! = tor = (an) = Toa 


we deduce that ‘ree = = (ath) = = a = = 12, and Oy 0 a 6. 


Example 9.5.3. From the previous example we know that }>7° 5 keg ae Mul- 


2k _ ata? 
k°x ~ (l= xz)? 


er Cer eaten 


It follows that S77" 5 3h = La pete = 52 and ))p-9 fe = 26. 


Summary: doko FT, = 2, ae BSD, ye = 26:18 it 
possible to predict peo at Rat! 


and differentiation gives 


tiplying both sides by x gives }7;° 5 


Example 9.5.4. 0), gr = In2. 


Proof. Let f(x) = 77), ane radius of convergence of f is 1, so 1/2 is in its domain. 
Also, f(x) = 372, a8! = tk, so that f(x) = —In(1— x) + C for some constant C. In 
particular, C = —-0+-C = —In(1—0)+C = f(0) = 0, so that (Ry oe = Ra me = 
f($) =—In(1 - §) = —In(§) = In(2). 


Exercises for Section 9.5 


9.5.1. Compute and justify 722, gr- 
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9.5.2. Consider 77°, SF According to my computer the partial sum of the first 1000 
terms is a rational number whose numerator and denominator take several screen pages, 
so this sum as a rational number is hard to comprehend. So instead I computed curtailed 
decimal expansions for this and for a few other sums: according to my computer, the 
partial sum of the first 10 terms is about 0.719474635555091, the partial sum of the first 
100 terms is about 0.719487336054311, the partial sum of the first 1000 terms is about 
0.719487336054311. What can you suspect? How would you go about proving it? 


9.6 Some technical aspects of power series 


The first theorem in this section is about products of power series and the second is 
about the convergence of a power series at the points on the boundary of the circle of the 
radius of convergence. This section is meant as a reference and should be skipped in a first 


class on power series. 


Theorem 9.6.1. Let °°. axx* and S7P-9 bxx® be convergent power series with radii 
of convergence R, and Rp, respectively. Let R = min{R,,R2}. Then on B(0, R) 
the Drone, sequence {()-p_9 @nx") + (Sop bev*)}n converges to the power series 
pp Foot 0 455k je". 

We write this as (d~P--p anx") - (Sop Oe”) = nO 9 ajb%-;)2* on B(O, R). 


Proof. If R = 0, there is nothing to prove. So we may assume that R is a positive real 
number or it is 00. 

Fix z € B(O,R). Set LD = Deo Meh and K = )--9bgx*. These limits exist 
because |2| < Ri, Rg. Also set sy = doy aun = oe F b,x", ee = ea apby 5 
By the theorem on the convergence of products a sequences (Theorem 8.4.3), lim(s,tn) = 
LK. Thus we need to prove that eas cpa*® = LK. 

Let « > 0. By convergence of products, there exists Ni, > 0 such that for all integers 
n> Ni, |Sntn — LK| < €/2. By expanding (repeated “foiling” ) and by grouping, 


— (> as) 2 b jx) 
k=0 j=0 
n k n n 
= > ys Ge Dip 3 > ptncuage 


k=0 m=0 k=1m=k 


k 
ee). y Ce en a: ae 


k=0 k=1m=k 
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Since |z| < R < Rj, Ro, the series S77. |axx*| and S7P°_, |b,x*| converge, to some real 
numbers L, K, respectively. Thus there exists No > 0 such that for all integers n > No, 


CO CO n € 
S> laxa®| = |S° lanx*| -— So laxa*|| < =—, 
k=n+1 k=0 k=0 Ak +1 


€ 
AL+1° 


and similarly there exists N3 > 0 such that for all integers n > N3, Ae lb,a*| < 
Now let n > max{2Nj1, No, N3}. Then 


n n 
Ss S- | Cae | 


k=1 m=k 
n [n/2] 


= am in n+k—mv 7 
ys SS | xb n+k | 


k=1 m=k 


n n 
+ y y lene Onsen 


k=1 m=max{k,|n/2|+1} 


[n/2] n 
< SS lane” a |b. 2" | + 
m=1 k=|n/2| 
n [n/2| 
ys lain | ye |b" | , 
m=|n/2\| k=1 


(this product expands to more non-negative terms) 
€ € = 


ee a 
4h4+1 4k41 
€ 
<< 5° 
Thus 
at 28 = So ut — tly uty = EA 
k=0 k=0 


IA 


+ |Sntp, — LK| 


n 
os Chak — Sntn 
k=0 
n n 
< s » Ce a AE cn 


k=1 m=k 
e/2+6/2 


+€/2 


A 


= €. 
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Theorem 9.6.2. Let f(x) = 07.9 cex* have radius of convergence a positive real num- 
ber R. Let a € C with |a| = R such that )\7-.)crha* converges. Let B be an open ball 
centered at a, and let g : B — C be continuous. If f(x) = g(x) for all x € B(O,R)NB, 
then f(a) = g(a). 


Proof. Let € > 0. We want to show that | f(a) — g(a)| < €, which via Theorem 2.10.4 then 
proves that f(a) = g(a). It suffices to prove the inequality |f(a) — g(a)| < € under the 
additional assumption that € < 1. 


Since 77-9 c,a® converges, by Cauchy’s criterion Theorem 9.2.1, there exists a posi- 
tive integer N such that for all integers n > N, | 7p_y cna*| < €/4. Let sm = eG c,a*. 
By assumption, for all m > 1, |sm|< ¢/4 <1. Furthermore, cy = so/a%, and for n > N, 
CNtn = (Sn — Siena. es 

Let r be a real number in the interval (0,1). By rewriting and by the triangle 
inequality then 
N+n 


+3 Ck (ra)* 


k=N 


N N N+1,.N+1 N N 
= lt Gna Sena re Tee ewe or ba 


= |r 59 + (81 — so)r Nt! + (82 — si) Nt? +++ + (82 — Snir | 
= |so(r* ape) segs (phe) ag NEA) ace 
+5n—1(r NTP) — pNP) + sy tm! 
=r |(L—r) (so + sir +--+ + 8,-19r""') + Snr” 
<r% ((1—r) ([s0| + |silr +--+ + |Sn—1|r"—") + |snir”) 
S art (A-r) (apres eee ie®) 
(since |s,,| < €/4 for all these m) 


= ra (l1—r"+r”) (by Example 1.6.4) 


Since polynomial functions are continuous, there exists 6; > 0 such that for all 
x € B(a, 61), ee chra* — ye cna < 6/4. 


Since g is continuous at a, there exists 62 > 0 such that for all  € BN B(a, 62), 
|g(z) — g(a)| < €/4. Let r € (0,1) such that 1—r < 4 min{6,, 52}. Then |a — ra| < 44, 62, 
so that 


Ig(@) — F(a)| 
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g(a)—g(ra) +g(ra)—f(ra)+f (ra) > Cha 4 cya* — f (a) 


N-1 N-1 
< |g(a)—g(ra)|+|g(ra)— f(ra)|+|f(ra)— $5 cpa*| +] S— cxa*— f(a) 
k=0 k=0 
love) N-1 N-1 fore) 
= |9(a)—g(ra)|+0+] S— cx(ra)*+ S© ex(ra)’—S° cpa*) +] S— ae! 
k=N k=0 k=0 k=N 
s love) N-1 N-1 love) 
ao ae cz (ra)*| + c,(ra)* — cpar| + > cya" 
k=N k=0 k=0 k=N 
Jee ae 
4 4 4 
=e 


Since ¢€ is arbitrary, by Theorem 2.10.4, f(a) = g(a). 


Exercises for Section 9.6 


9.6.1. Expand ()7-., 2”)? as a power series, i.e., with proof determine all the coefficients 
of the series. 


9.6.2. Let 77°, GRE. pees b,x" be power series and R a positive real number such that 
for all x € B(O, R), (WR anx*) « (ORG bew*) = 1. In other words, at each x in B(0, R), 
the two infinite series are multiplicative inverses of each other. 
i) Prove that for all x € B(0, R), 7.9 anx*® #0. 
ii) Prove that ao # 0. 
iii) Prove that bo = 4, and that for all k > 0, bp = —4 S0j_, ajby—j. (Hint: Theo- 
rem 9.6.1.) 
iv) Suppose that there exists M € R such that for all poe eee integers n, 
lanz"| < M. Prove that for all integers k > 1, |bya*| < (Mtlsolye—-1 (Hint: 


|a0| 
induction. ) 


lao 


9.6.3. (Abel’s scape cea! that ao, a1, @2,... are complex numbers and that )°, ax 
converges. Let f(x) = >>, a,x". The goal of this ¢ exercise is to prove that f is defined on 
B(O,1) U {1}, that it is continuous on B(0,1), and that when the domain is restricted to 
(B(0,1) U{1}) NR, then f is continuous also at 1. 

i) Prove that the domain of f includes B(0,1) U {1}. 

ii) Prove that f is continuous on B(0,1). (Hint: Invoke a theorem.) 


t “Lemma” means a “helpful theorem”, possibly not interesting in its own right, but useful later. There 
are examples of so-called lemmas that have turned out to be very interesting in their own right. 
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Set LD = )°, ax, and for n > 0, set s, = a9 +a, +--- +a, —L. 
iii) Prove that )7;'_ sx(L—ax)a* = s,(1—2"4!) +07, ax (x* — 1). (Hint: Summation 
by parts, see Exercise 9.2.11.) 
iv) Prove that for every « > 0 there exists N € N such that for all integers n > N, 
|Sn| <€. 
v) Prove that the sequence {}7/_, sx(1 — x)x*}, converges for all x € B(0,1) U {1}. 
vi) Prove that f(x) — L = 77.9 s¢(1 —2)a* for all x € B(0,1) U {1}. 
vii) Let « > 0. Prove that there exists N € R such that for all integers m > N and for 
Yea sed —2)2*| <4 
viii) Let € > 0. Prove that for every positive integer n there exists 6 > 0 such that for 
all complex numbers « € (B(0,1) U {1}) N B(1,4), |S°¢_5 s4(1 — a)2"| see. 
ix) Let « > 0. Prove that there exists 6 > 0 such that for every real number x in the 
interval [max{0, 1 — 6}, 1], |S°729 se(1 — oa" | = €: 
x) Let g: [0,1] — C be defined by g(x) = f(x). Prove that g is continuous. 
*9,6,4. Let f(z) = 32/2, axa” and g(z) = >> 4 bew*. 
i) Express (f 0 g)(x) as a power series in terms of the a; and 6;. 


all real numbers z in the interval [0, 1], 


ii) What is special for the power series of the composition if aj = 0? 
iii) Assuming that a9 = 0, and given the radii of convergence for f and g, what would 
it take to find the radius of convergence of the composition series? 


9.7 ‘Taylor series 


A common way of generating power series comes from approximations of functions. 


Definition 9.7.1. Leta be in the domain of a function f, and assume that f has derivatives 


of all orders at a. The Taylor series of f (centered) at a is the series )>;~ 5 p@ (x—a)*. 


Remark 9.7.2. If a = 0, the Taylor series is a power series (as defined in this chapter), 
and for other a this is also a power series but of a more general kind which can nevertheless 
be easily transformed into a usual power series in the following sense. Let f : A — C. Set 
B={x-a:2€A}h,andg: B-> Cas g(x) = f(x+a). By straightforward calculus, 
>>, ax(2 — a)" is a Taylor series for f at a if and only if 3°, a,2" is a Taylor series for g 
at 0. Furthermore, the radius of convergence of the Taylor series for g is R if and only if for 
all x € C, the Taylor series }>, ax(a —a)* for f at a converges at x whenever |x —a| < R 
and diverges at « whenever |x — a| > R. 

Thus any analysis of Taylor series can by a function domain shift be transformed 
into a Taylor series that is a power series. 
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Section 6.5 contains many examples of Taylor polynomials and a few more are com- 
puted in this section. For some functions this computation is easier than for others. The 
following theorem covers a trivial computation, and it applies in particular to all polynomial 


functions. 


Theorem 9.7.3. Let f(x) = >°?2.9 axx* be a power series whose radius convergence is 
not zero. Then the nth Taylor polynomial of f centered at 0 is 


n 
) ape, 
k=0 


and the Taylor series of f centered at 0 is 
CO 
SS apa* . 
k=0 


Proof. By Exercise 9.4.3, f'@) equals S77" ax ame” Thus pO) equals am. 


m! 


This proves the form of the Taylor series. 


Remark 9.7.4. Theorems 6.5.5 and 6.5.6 say that some Taylor series are convergent power 
series at each x near 0 and that they converge to the value of the original function. This 
is certainly true for functions given as power series (such as in the theorem above). We 
examine a few further examples in this section with a few left for the exercises. Beware 
that the Taylor series need not converge to its function at any point other than at a, see 
Exercise 9.7.3. 


Example 9.7.5. Let f(x) = e”, where the domain of f is R. It is easy to compute the 


Taylor series for f: 
k 


= x 


By the ratio test or the root test, this series converges for all « € C, not only for x € R. 


(More on this series is in Sections 10.1 and 10.2, where we learn more about exponentiation 


by complex numbers.) Let « € R. By Theorem 6.5.5, for every positive integer n there 
exists d, between 0 and x such that 


nok d 
x v e” n+1 
e = 
cee 
Thus 
Lee dn |x| 
x zr = € n+1 € n+l 9 7 6 
" di (n+1)! (n+1)! 28) 
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|a| 


The sequence 4 ~-~—~2"*! + converges to 0 by the ratio test for sequences. This proves 
(n+1)! 


that for each x € R, the Taylor polynomials for e” approximate e* arbitrarily closely, and 


that the Taylor series at each x equals e”: 


In practice, this is precisely how one (a human or a computer) computes values e” to 
arbitrary precision. For example, to compute e°-! to within 0.0001, we first need to find n 
such that the remainder as in Equation (9.7.6) is at most 0.0001. We first simplify an 
upper bound on the remainder to make the finding of n easier: 

el0-1 
(n+1)!- ~ (n+1)! 
We can check that the smallest integer n for which the last expression is at most 0.0001 


is n = 3, so that to within 0.0001, e°! is Ss alt = 6631/6000 © 1.10516666666667. 


n+1 0 yeti 


To compute e~°!, the same n works for the desired precision, giving e~°! to be 
k 

within 0.0001 equal to SZ) 22 = 422 & 0.904833333333333. Incidentally, 

16 0.1% _  21021144462931669584965107651 ~w 16> SSO 

k=-0 gr = T9020718080000000000000000000 = 1-10517091807565 and )),-9 a - 


7011749326061779176954720883 ~w 
7749181440000000000000000000 ~~ 0.90483741803596. 


To compute e” to within 0.0001, then with some simplication we need to find an 


integer n such that 
Tv 


34 


sa re OOO, 
(n+ 1)! * 


(n+ 1) S 


€ sans 


With elementary (but tedious) arithmetic we can compute that the inequality holds for 
n = 19 and bigger but not for n = 18. Thus we can compute e” to 0.0001 of precision as 
hog Sr = 23.1406926285462 + 0.0001. 


Example 9.7.7. Let f(x) = In(a +1), where the domain of f is (—1,00). It is straight- 
forward to compute the Taylor series for f centered at 0: 


(=) tak SA (=a) 
> k aan DS k 


k=1 k=1 


By the Ratio test for series, Theorem 9.2.3, the radius of convergence for this series is 1. 
It is worth noting that the domain of the function f is all real numbers strictly bigger 
than —1, whereas the computed Taylor series converges at all complex numbers in B(0, 1) 
and diverges at all complex (including real) numbers whose absolute value is strictly bigger 
than 1. By Example 9.1.7, the series diverges at x = —1, and by Theorem 9.2.5, it converges 
at x = 1. Furthermore, by Exercise 9.2.14, the series converges at all complex numbers x 
with |z| < 1 and « #1. You should test and invoke Theorem 6.5.5 to show that for all 
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€ Goel 1); In(x + 1) 5 et i 
Incidentally, since In(a + 1) is continuous on its domain and since its Taylor series 
converges at x = 1 by the Alternating series test (Theorem 9.2.5), it follows from Theo- 


rem 9.6.2 that 
—1) k-1 


CO 
9 ae = In 2, 
k=1 
With the help of Taylor series we can similarly get finite-term expressions for other 


infinite sums. Below is a harder example (and the reader may wish to skip it). 


Example 9.7.8. Let f(x) = /l—«z. The domain of f is the interval (—oo, 1]. On the 
sub-interval (—oo, 1) the function has derivatives of all orders: 


f(a) =-5(1- 2)”, 


LL 
" yes = —3/2 
m ae —5/2 
f ( ) 9 9 5 (1 z) ca aeiane 
1 1 3 2n—3 
Gig 2 aa ee ee. Lp) 2n-D)/2 


1 ~.1-3-5---(2k—3) , 
Bega Do Rg -_ 


k=2 
For anes n, the quotient of the (n + 1)st coefficient divided by the nth coefficient equals 


2n—1_ | whose limsup equals 1. Thus by the Ratio test for power series (Theorem 9.3.9), 


CEE ’ 
the Taylor series converges absolutely on oe 1), and in particular it converges absolutely 
n (—1,1). Furthermore, the quotient a = = aera above is at most 1 — 43 for all 


n > 4. Thus by Raabe’s test (Exercise 9.2.17), this Taylor series converges at x = 1, and 
so it converges absolutely on [—1, 1]. But what does it converge to? Consider x € (—1, 1). 
We use the integral form of the Taylor’s remainder theorem (Exercise 7.4.9): 


T = = SER eke 1 n dt 


-[(() Sea 
eo fe, oak di nlart1/T—t 


As the integrand goes to 0 with n, and as |x| < 1, the integral goes to 0 with n, so that 


the Taylor series converges to f on (—1,1). (Incidentally, an application of Exercise 9.6.3 
shows that the Taylor series is continuous on [—1,1], and as f is also continuous there, 
necessarily the Taylor series converges to f on [—1, 1].) 
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Exercises for Section 9.7 


9.7.1. (This exercise has overlap with Exercise 6.5.7.) Let f(z) = <4. 
i) Compute f(x) for all integers n > 0. 


Compute and justify the Taylor series of f centered at 0. 


) 
) 
iii) Determine the radius of convergence. 
iv) What is the domain of f and what is the domain of its Taylor series? 
) Compute the Taylor series for f centered at 2. Determine its radius of convergence. 
i) 


vi) Compute the Taylor series for f centered at 3. Determine its radius of convergence. 
Compare with the previous parts. 


9.7.2. Let f(x) = In(x +1), where the domain of f is (—1,co) as in Example 9.7.7. 
Compute f(0.001) from one of its Taylor polynomials to four digits of precision. Show all 
work, and in particular invoke Theorem 6.5.5. 

9.7.3. Let f : R—-R be given by 


= en t/2? it eae 0; 
fee)= {6 if2=0 


as in Exercise 7.6.16. 
i) Compute and justify the Taylor series for f centered at 0. 
ii) Compute the radius of convergence for the series. 
iii) Discuss whether it possible to compute f(0.001) to arbitrary precision from this 


series. 


*9.7.4. In introductory analysis courses we typically handle infinite sums but not infinite 
products or even longer and longer products. In Example 9.7.8 we proved that the Taylor 
series for f(x) = V1 — «x centered at 0 converges on [—1,1]. This in particular means that 


1-3-5-+-(2n—3) 


nine |, converges to 0. Can you prove this directly without involving 


the sequence { 
Raabe’s test? 


Chapter 10: Exponential and trigonometric functions 


The culmination of the chapter and of the course are the exponential and trigonomet- 
ric functions with their properties. Section 10.4 covers more varied examples of L’H6pital’s 
rule than what was possible in Section 6.4. 


10.1 The exponential function 


Define the power series 
CO Lk 
x 
k=0 


By Examples 9.3.10 (3), the domain of EF is C. Thus by Section 9.4, F is differentiable 
everywhere. 

We prove below that this function is the exponential function: the base is e and the 
exponent is allowed to be any complex number x not just any real number. 


Remarks 10.1.1. 
(oe) gk-t (oe) x CO x 
(1) B'(2) = ea “a = Gat = Lk=o a = Ele). 
(2) E(0O) =1. 
(3) Let a € C. Define g: C > C by g(x) = E(x +a): E(—x). Then g is a product of 
two differentiable functions, hence differentiable, and 


g(a) = E'(a@+a)- E(—2) + E(x +a) - E’(—2) - (-1) 
= E(x +a)-E(—x) — E(a+a)- E(-2) 
= 0, 
so that g is a constant function. This constant has to equal 
g(0) = E(0+a)- E(—0) = E(a). 
Thus for all a and z in C, g(x) = g(0), ie., that E(a + a)E(—x) = E(a). 
(4) Thus for all c,d € C, E(c +d) = E(c)E(d) (set c=x+a,d=-—2). 
(5) By induction on n and by the previous part, it follows that for all positive integers 
n and alla €C, E(na) = (E(a))”. 
(6) Part (3) applied to a = 0 gives that 1 = E(0) = E(x)E(—a). We conclude that 


E(a) is never 0, and that E(—x) = TE 
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(7) By parts (5) and (6), E(na) = (E(a))” for all integers n and alla € C. 

(8) Let a,b € R. By part (4), E(a+ bi) = E(a)- E(bi). Thus to understand the 
function E : C > C, it suffices to understand F restricted to real numbers and E 
restricted to 2 times real numbers. We accomplish the former in this section and 
the latter in the next section. 


Theorem 10.1.2. For all € R, E(x) = In '(x) = e*, the exponential function from Sec- 
tion 7.6. In particular, E(1) equals the Euler’s constant e. 


Proof. Define f : RR by f(z) = =. Then f is differentiable and 


ex 


_ E(eje* — E(ayle*y’ _ E(w)e* — E(zje* _ 
(er)? (ex) , 


Thus f is a constant function, so that for all z € R, 2 te). 7 (0) EO) p= ly 
Thus E(x) = e?. By Theorem 7.6.7, this is the same as In~'(a). In particular, E(1) = 


e Se. 


f'(2) 


The following is now immediate: 


Theorem 10.1.3. The function E restricted to R has the following properties: 
(1) The range is all of R*. 
(2) E restricted to R is invertible with E~‘(x) = In(z). 


The sequence {}>;_) @}n of partial sums for E(1) = e! = e is convergent, and 


starts with 1,2,2.5,8 ~ 2.66667, $9 ~ 2.70833, 48 = 2.71667, 33% =~ 2.71806, $8 = 


2.71825, OO = 2.71828, 382" = 2.71828. The last approximation is from the Taylor 


polynomial of degree 9 and it is correct to 5 decimal places because by the Taylor remain- 
der theorem (Theorem 6.5.5) there exists d € (0,1) such that 
_ | £O%(d) eo 3 


yap eee @ 1) = |= — 0)? | So 2 = S283 x07" 
IE(L) — To,2,0(1)| moo io ig. 


Exercises for Section 10.1 


10.1.1. Use the definition of In from Theorem 10.1.3 to prove that for all x,y € RX 
In(ay) = Inz + Iny, and that for all integers n, In(x”) = ning. 


’ 


10.1.2. Use the definition of In from Theorem 10.1.3 and derivatives of inverse functions 
(Theorem 6.2.7) to prove that (In)/(x) = 4. 
10.1.3. Prove that any function from R to R that equals its own derivative equals 


cE(restricted to R) for some c € R. 
Similarly prove that {f :C— C differentiable | f’ = f} = {cE : ce C}. 
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10.1.4. Find ao, a,,a2,... € R such that the power series ease a,x* converges for all 
reERtoe®. (Hint: Use the function E to do this easily.) With that, determine a power 
series whose derivative is e(”), (This problem is related to Exercise 9.4.2; it turns out that 


there is no simpler, finite-term antiderivative of e® ) 


10.1.5. Determine a power series whose derivative is g(2*), (It turns out that there is no 
simpler, finite-term antiderivative of this function — see the no-closed-form discussion on 
page 259). 

10.1.6. Express }>;" 5 oe in terms of e. 


10.1.7. Express 77° 5 in terms of e. 


gk 
3*(k+1)! 
10.1.8. Compute a Taylor polynomial of degree n of the function FE’ centered at 0. (Do as 
little work as possible, but do explain your reasoning.) 


10.1.9. Numerically evaluate e? from the power series for e* = E(x) to 5 significant digits. 
Prove that you have achieved desired precision. (Hint: Theorem 6.5.5.) 


10.1.10. Compute lim,_,, E(x). (Do as little hard work as possible, but do explain your 


reasoning. ) 


10.1.11. With proof, for which real numbers a does the sequence { E(na)}, converge? 


10.2. The exponential function, continued 


In this section we restrict F from the previous section to the imaginary axis. Thus 
we look at E(ix) where x varies over real numbers. Note that 


k=0 
= a ie * (ix)? n (ix)? (ix)* (ix)® (ix)® (ix)’ (éx)8 ai 
1! 2! 3! 4) 5! 6! 7! 8! 
2 a. id 5 6 7 igs 


=P of Sr al ee. ok Cae @ 


We define two new functions (their names may be purely coincidental, but pronounce them 


uA 


for the time being as “cause” and “sin”): 


COS (#) = Re E(ix) =1 + Rp eee ae 


ea ae ie 
SIN (x) = Im E(iz) = Tae ah ie + a pices 


Thus 
E(ixz) = COS (a) + iSIN (2). 
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Since E(ix) converges for all x, so do its real and imaginary parts. This means that COS 
and SIN are defined for all x (even for all complex x; but in this section all x are real). 
Thus the radius of convergence for the power series COS and SIN is infinity. 

We can alternatively practice the root test on these functions to determine their radii 
of convergence: the a for the function COS is lim sup {1, 0, ‘/| xi|>0, i/| i <0; ¢/| ai 
Since 0s do not contribute to limsup of a sequence of non-negative numbers, it follows that 
a = limsup {1, ‘/| 4 : i/| 4 ; ‘/| A a2}: By Example 8.3.6 this is 0. Thus the radius 
of convergence of COS is oo. The proof for the infinite radius of convergence of SIN is 


pee . 


similar. 


Remarks 10.2.1. 
(1) E(ix) = COS (x) +7SIN (x). This writing is not a random rewriting of the sum- 
mands in E(ix), but it is the sum of the real and the imaginary parts. 
(2) COS (0) = Re F(z - 0) = Rel = 1, SIN (0) = Im F(¢-0) =Im1=0. 


(3) By the powers appearing in the power series for the two functions, for all x € R, 


COS (—x) = COS (a), SIN (—a) = —SIN(z). 
Thus E(—ix) = COS (a) — 7 SIN (a), which is the complex conjugate of E(ix). 
(4) For all z ER, 


x) +iSIN (a)) - (COS (a) — i SIN (2)) 
x) +7SIN (a)) - (COS (—a) +7 SIN (—2)) 


Thus | (ix)| = 1 for all x €] 
(5) We conclude that for all x € R, the real and imaginary parts of E'(ix) have absolute 


value at most 1. In other words, 
—1< COS (a), SIN (2) < 1. 
(6) Since E is differentiable, 
(E(ix)) = E'(ix)i = Elix)i 
= (COS (xz) +7SIN (x))i = 7COS (2) — SIN (2). 
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It follows by Theorem 6.2.6 that 
(COS («))' = (Re(E(iz)))' = Re((E(iz))’) = —SIN (2), 


and 
(SIN (x))’ = (Im(E(ixz)))’ = Im((E(éz))’) = COS (x). 


Theorem 10.2.2. There exists a unique real number s € (0,3) such that E(is) = i. 
In other words, there exists a unique real number s € (0,./3) such that COS(s) = 0 and 
SIN (s) = 1. 


Proof. The function t — SIN (t) is differentiable and its derivative is 1 — COS (t), which is 
always non-negative. Thus t — SIN (t) is non-decreasing for all real t, so that for all t > 0, 
t—SIN (t) > O—SIN (0) = 0. Hence the function a +COS (t) has a non-negative derivative 
on [0, 00), so that + COS (#) is non-decreasing on [0,0o). Thus for all t > 0, e + COS (t) 
= us + COS (0) = 1. It follows that the function us — t+ SIN (t) is non-decreasing on 
[0, co). [HOW LONG WILL WE KEEP GOING LIKE THIS?] Thus for all t > 0, e —t+SIN (¢) 


= on —0+SIN (0) =0. Thus = - — COS (t) is non-decreasing on [0, 00), so that for all 


t> 0, a — ee — COS (t) > oe - te — COS (0) = —1. We conclude that for all t > 0, 
ue 
COS (t) < 7 i gti 


In particular, COS (V3) < ek — va +1 = —% < 0. We also know that COS(0) = 
1 > 0. Since COS is differentiable, it is continuous, so by the Intermediate value theorem 
(Theorem 5.3.1) there exists s € (0, /3) such that COS (s) = 0. 

Now suppose that there exists a different u € (0,3) such that COS(u) = 0. By 
possibly switching the names of s and wu we may assume that s < u. Note that since s and 
u are both positive, s? < u?. Since . _ S — COS (#) is non-decreasing on [0, 00), it follows 
that a 8 = COS (u) > = - ce — COS (s). Since COS (wu) = COS (s) = 0, this says that 


2 
ut U s 
24 2 2 


q — %- In other words, 


1 1 2 2 1 1 u2 — s2 tae ses 
Bs es Fegan yt et) eee PY = = 
4 rie ve pg PE) gg OO) ene on ge OF 


which is a contradiction. Thus s is unique. 

Since (COS (s))? + (SIN (s))? = 1, we have that SIN (s) = +1. Since COS is positive 
on (0, V3), this means that SIN is increasing on (0,/3), and so by continuity of SIN, 
SIN (s) must be positive, and hence SIN (s) = 1. This proves that E(is) = 1. 
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Remark 10.2.3. The proof above establishes the following properties for all t > 0: 


COS(t)<1, COS()>1- 5, COS@)<1-F+5, 


t 
SIN (t)<t, SIN(@) 2t- =. 
Observe that the polynomials above alternately over- and under-estimating COS are the 
Taylor polynomials of COS (by Theorem 9.7.3) and are converging to the Taylor series 
for COS. Similarly, the polynomials above alternately over- and under-estimating SIN are 
the Taylor polynomials of SIN converging to the Taylor series for SIN. 


Remark 10.2.4. (About the numerical value of s.) What role does /3 play in Theo- 
rem 10.2.2? We deduced that s < /3 from knowing that COS (t) < 1 — ¢?/2+ ¢*/4! at all 
t > 0, from knowing that the latter polynomial evaluates to a negative number at 3, and 
from an application of the Intermediate value theorem. You probably suspect that s = 7/2. 
One can check that 7/2 < V3, and even that 7/2 < 0.91. V3. However, 1 — t?/2 + 4/4! 
at 0.91 - /3 is positive, so we cannot conclude that s < 0.91- /3 from this argument. 
But if in the proof of Theorem 10.2.2 we compute a few more upper and lower bounding 
polynomials of SIN and COS, we obtain that COS (t) < 1 — t?/2 + t*/4! — t°/6! + t8/8! 
for all t > 0, and this latter polynomial is negative at 0.91 - 3, which would then by the 
Intermediate value theorem guarantee that s < 0.91-/3. By taking higher and higher 
degree polynomials as in the proof we could get tighter and tighter upper bounds on s. 
We can also get lower bounds on s. From COS (t) > 1 — t?/2 (as in the proof) we deduce 
that COS (t) is positive at 0.999999 - \/2 (with an arbitrary finite number of digits 9). Thus 
the unique s must be in [v/2, 0.91 . V3). More steps in the proof also show that From 
COS (t) > 1 —#t?/24 t4/4! — t®/6! for all t > 0, and this polynomial is negative at 0.9- V3. 
Thus by the Intermediate value theorem, 


0.9:-V3<s <0.91- V3. 


Incidentally, my computer gives me 1.55884572681199 for 0.9 - 3 and 1.57616623488768 
for 0.91-/3. Thus we know the in-between s to a few digits of precision. (Incidentally, my 
computer gives me 1.5707963267949 for 7/2.) By taking higher-degree polynomials in the 
more-step version of the proof of Theorem 10.2.2 we can get even more digits of precision 
of s. However, this numerical approach does not prove that s is equal to 7/2; we need 
different reasoning to prove that s = 1/2 (see Theorem 10.2.5). 


Finally, we connect COS and SIN to trigonometric functions. First we need to 
specify the trigonometric functions: For any real number f¢, 


cos(t) + isin(t) 
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is the unique complex number on the unit circle centered at the origin that is on the half- 
ray from the origin at angle t radians measured counterclockwise from the positive x-axis. 
In terms of ratio geometry (see page 20), this says that in a right triangle with one angle 
t radians, cos(t) is the ratio of the length of the adjacent edge divided by the length of 
the hypotenuse, and sin(t) is the ratio of the length of the opposite edge divided by the 
length of the hypotenuse. Geometrically it is clear, and we do assume this fact, that cos 
and sin are continuous functions. In general continuity does not imply differentiability, but 
for these two functions we use their continuity in the proof of their differentiability. 


Theorem 10.2.5. The trigonometric functions cos and sin are differentiable. Furthermore, 
COS and SIN are the functions cos and sin, s = 7/2, and for any real number x, E(ix) 
is the point on the unit circle centered at 0 at angle x radians measured counterclockwise 
from the positive real axis. 


Proof. We know that E'(izx) is a point on the unit circle with coordinates (COS (a), SIN ()). 
What we do not yet know is whether the angle of this point counterclockwise from the 
positive real axis equals x radians. 

Let s be as in Theorem 10.2.2. The angle of E(is) = 0+i%-1 = 7 measured in radians 
counterclockwise from the positive real axis is 7/2. Uniqueness of s guarantees that COS 
is positive on |0,s). Thus SIN is increasing on [0, s], and it increases from 0 to 1. Thus for 
all t € (0, s), E(it) is in the first quadrant. 

Let n be an integer strictly greater than 1. Let a be the angle of E(is/n) measured 
counterclockwise from the positive real axis. For every integer j, (E(ijs/n)) = E(is)4, so by 
Theorem 3.12.2, the angle of (E(ijs/n)) is ja, and in particular the angle na coincides with 
the angle of E(is) = E(ins/n), ie., with 5 + 27k for all integers k. Thus a = (5 +2rk)/n 
for some integer k. For all 7 = 1,...,n —1, j/n € (0,1), and so E(ijs/n) is in the first 
quadrant by the previous paragraph. Thus 


0< ja = j>-(1 + 4k) < -- 


The first inequality says that k is not negative, and the second inequality used with 7 = n—1 


says that k is not positive. Thus k = 0 anda = + 


2n° 
Together with Theorem 3.12.2 we just established that for all positive integers n and 
all integers j, the angle of E(isj/n) = (E(is/n))9 is ja = - 4. In other words, for all 
rational numbers r, 


2 
By continuity of the functions EF’, cos and sin we conclude that for any real number 2, 


COS (sr) + 7SIN (sr) = E(isr) = cos (Fr) +isin (Sr) : 


COS (sz) + 7SIN (sx) = E(isx) = cos & +isin (Fz) : 
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By matching the real and the imaginary parts in this equation we get that for all x € R, 


Satay = COS (2c), snap SiN (2c), 


T 
and so cos and sin are also differentiable functions. By the chain rule, 


2 2 2 
eos (a= _“*sIN (22) a a sin(2), 
7 T 7 
2 2 2 
sin’ (2)'= ** cos (=) a cos(x). 
7 T 7 


x 1 
sin x cos x* 


By geometry (see Exercise 1.1.19), for small positive real x, 0 < cos(x) < 
Since cos is differentiable, it is continuous. Since cos(0) = COS (0) = 1, by the Squeeze 


theorem (Theorem 4.4.11), lim,_,9+ =*= = 1. Hence 
2 2 inx — sinO i 
= ** cos(0) = sin’(0) = lim ee =n ee = L 
T x0 x a30+ 


whence s = 7/2. This proves that cos = COS and sin = SIN. 


Theorem 10.2.6. Every complex number x can be written in the form rE(i@), where 
r = |x| is the length and 0 the angle of x counterclockwise from the positive real axis. 


Proof. Let r = |x|. Thus z lies on the circle centered at 0 and of radius r. If r = 0, then 
x = 0, and the angle is irrelevant. If instead r is non-zero, it is necessarily positive, and 
x/r is a complex number of length |a|/r = 1. By Theorem 3.12.1, x/r and x have the same 
angle. Let 6 be that angle. Then x/r = E(i@), so that « = rE(i6). 


Notation 10.2.7. It is common to write E(x) = e® for any complex number x. We have 
seen that equality does hold if x is real, but we adopt this notation also for other numbers. 
With this, if x,y € R, then 


erty = ete, 


and e” is the length and y is the angle of e”+’Y counterclockwise from the positive x axis. 


Exercises for Section 10.2 

10.2.1. Let a@ and r be real numbers with r > 0. Using the findings from this section 
prove that for any complex number x, the complex number re’ - x is obtained from x by 
rotating x counterclockwise around the origin by angle a radians and by stretching the 
result by a factor of r. (Compare with Theorem 3.12.1.) 

10.2.2. Prove that e’” + 1 = 0. (This has been called one of the most beautiful equalities, 
because it connects the fundamental numbers 0, 1, e and 7.) 
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10.2.3. Let x,y € R. Expand both sides of E(i(a + y)) = E(ix)E(iy) in terms of sin and 
cos to prove that cos(x + y) = cos(x) cos(y) —sin() sin(y), and sin(a# + y) = sin(x) cos(y) + 
cos(x) sin(y). (Cf. Exercise 1.1.16.) 


10.2.4. Prove de Moivre’s formula: for all n € Z and all real numbers z, 
(cos(a) + isin(x))” = cos(nx) + isin(nz). 


10.2.5. Prove that for every real number x that is not an integer multiple of 27, the series 


CO (oe) 


cos( ka) sin(ka) 
»B eS 1S ke 
k=1 k=1 


converge. (Hint: The complex series }>7°, 


cos(ke) ¥4 sin(kz) converges if and only if its 


real and imaginary parts converge. Use De Moivre’s formula (Exercise 10.2.4) and Exer- 
cise 9.2.12.) 


10.2.6. Express the following complex numbers in the form e*+’Y: i, —i, 1, —1, e, 2+ 2%. 
Can 0 be expressed in this way? Justify. 


10.2.7. Express (3 + 4i)° in the form e*+” for some a,b € R. (Do not give numeric 
approximations of a and 6.) 
10.2.8. Prove that for any integer m, the sequence {E(2aimn)},, converges. Prove that 
the sequence {£(./27in)}, does not converge. Determine all real numbers a for which the 
sequence {F(ina)},, converges. Prove your conclusion. 
10.2.9. For which real numbers a and (6 does the sequence {E(n(a + iZ))}, converge? 
Prove your conclusion. 
10.2.10. Let f(x) = sin(x?) or f(x) = cos(x?). 
i) Determine the Taylor series for f centered at 0. (This should not be hard; do not 
compute derivatives of all orders.) 
ii) Determine a power series whose derivative is f. (There is no simpler or finite-term 
antiderivative of f.) 
iii) Determine the radius of convergence of the antiderivative power series. 
iv) Use the Taylor remainder theorem (Theorem 6.5.5) to compute Io f to within five 
digits of precision. 
10.2.11. (This is from the Reviews section edited by P. J. Campbell, page 159 in Mathe- 
matics Magazine 91 (2018).) Let P be a polynomial function and m a non-zero constant. 


i) Prove that P + P + Pe + P* +--- is a polynomial function. 
ii) Compute the derivative with respect to x of 


(ry 2220 A.) 


Section 10.3: Trigonometry 357 


iii) Integrate ecm P(e)ae. 


*10.2.12. The goal of this exercise is to give another proof that s = 7/2. Let f : [0,1] ~ R 
be given by f(x) = /1 —2?. Then the graph of f is the part of the circle of radius 1 that 
is in the first quadrant. 

i) Use lengths of curves and that 27 is the perimeter of the circle of radius 1 to justify 


that 
1 2 
T —22 
a 1+ | ————.]_ dz. 
2 [ / (= 1- =) 


(Caution: This is an improper integral.) 
ii) Compute the integral via substitution x = COS (u). The integral should evaluate 
to s from Theorem 10.2.2. 


10.3. Trigonometry 


In this section we review the semester’s concepts of limits, continuity, differentiability, 
and integrability on the newly established functions from the previous section. 


Definition 10.3.1. (Trigonometric functions) For any real number z, e’* is the complex 
number that is on the unit circle at angle x radians counterclockwise from the positive 
horizontal axis. 

(1) sin(x) is the imaginary part of e**. 

(2) cos(x) is the real part of e’”. 


(3) tan(z) = Or cot(z) = ae), sec(r) = ETCH esc(x) = 


i 
sin(x) ° 


Remarks 10.3.2. The following is straightforward from the work in the previous section: 
(1) sin and cos are differentiable and hence continuous functions whose domain is R. 
(2) The Taylor series for sin is 


ees 
2") ORF I! 


and it converges to sin(x) for all « € R. Similarly, the Taylor series for cos is 


0° ok 
cos(a) = dela Cais 


(3) All trigonometric functions (as in the definition above) are continuous and differ- 
entiable on their domains. By the quotient rule for differentiation, 
tan’ (x) = (sec(zx))?, cot’(x) = —(cse(2x))?, 
sec’(x) = sec(x)tan(x), csc’(a) = — csc(x) cot(zx). 
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(4) For all « € R, sin(x + 27) = sin(x) and cos(a% + 277) = cos(x). This is not obvious 
from the power series definition of E(ix), but if follows from Theorem 10.2.6. 

(5) sin(x) = “52 

(6) cos(x) = ete 

(7) (sin(x))? + (cos(x))? = 1. (Recall from Remark 2.4.8 that for trigonometric func- 
tions we also write this as sin?(a«) + cos?(x) = 1, but for an arbitrary function f, 
f(x) refers to f(f(ax)) rather than for (f(a))?.) 

(8) Dividing the equality in the previous part by (cos(z)) 

(tan(x))? + 1 = (sec(zx))?. 

(cot(x))? + 1 = (cse(x))?. 


2 yields the equality 


) 

) sin’(a) = cos(x). 
11) cos’(x) = —sin(x). 
) 

) 


For all x € R, sin(a) = — sin(—2) and cos(x) = cos(—2). 

sin and cos take on non-negative values on [0,7 /2]. By the previous part, cos takes 

on non-negative values on |[—7/2, 7/2]. 

(14) sin is increasing on [—7/2,7/2] since its derivative cos is non-negative there and 
positive on (—7/2, 7/2). 

(15) sin, when restricted to |—7/2,7/2], has an inverse by Theorem 2.9.4. The inverse 
is called arcsin. The domain of arcsin is [—1, 1]. 

(16) Geometrically, sin takes on non-negative values on [0,7] and positive values on 
(0,7), so that cos, when restricted to [0,7], has an inverse, called arccos. The 
domain of arccos is [—1, 1]. 

(17) Verify the details in the following. The derivative of tan is always non-negative, so 

that on (—7/2, 7/2), tan is invertible. Its inverse is called arctan, and the domain 


of arctan is (—o0, oo). 


Theorem 10.3.3. Refer to Theorem 6.2.7. 
(1) For x € (—1/2, 7/2), arcsin'(x) = =>. 
/ = =1 
(2) For x € (0,7), arccos’(x) = Ft 
(3) arctan’ (a7) = 


_1 
lta2° 


Proof. We prove the first and the third parts; the proof of the second part is similar to the 
proof of the first. 


1 
arcsin’ (x) = — : (by Theorem 6.2.7) 
sin (arcsin(z)) 


1 
cos(arcsin(2)) 
1 


\/(cos(arcsin(x)))? 
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(because cos is non-negative on |—7/2,7/2]) 


1 
7 \/1 — (sin(aresin(zx)))? 
1 
ers 
1 


arctan’ (x) = (by Theorem 6.2.7) 


tan’ (arctan(z)) 
1 
(sec(arctan(«)))? 
1 
1+ (tan(arctan(zx)))? 
1 
14+ 22° 
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10.3.1. Prove that there exists a real number x such that cosx = x. (Hint: Intermediate 


value theorem Theorem 5.3.1.) 


10.3.2. Prove that for all positive x, x > sin(x). (Hint: prove that f(x) = x — sin(z) is 


an increasing function.) 
10.3.3. Fix a constant c and let f : R — R be defined as 
_ sin(+), ite 0; 
fo) = {8 Ho 0: 
Prove that f is not continuous at 0 no matter what c is. 


10.3.4. Let f : R— R be defined as 


= [epsin( Ss) ahase0; 
fe= {5 if2=0. 


i) Prove that f is differentiable at all non-zero x. 


ii) Prove that f is continuous but not differentiable. (Hint: not differentiable at 0.) 


10.3.5. Let f : R— R be defined as 
sin(4), ifa 40; 


2 
He={5 " if2=0. 
i) Compute f’. 


ii) Prove that f’ is not continuous (at 0). 
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10.3.6. Let f : R— R be defined as 
_ Ja*sin(4), if x £0; 
fe) = {8 if =0. 
i) Compute f’. 
ii) Prove that f’ is continuous but not differentiable. 


10.3.7. Let f : R— R be defined as 


_ fa*sin(+), if e £0; 
fe)={F if « =0. 


JR 


i) Compute f’ and f”. 
ii) Is f” continuous or differentiable? Prove your claims. 
10.3.8. Let f : [0,7/2] — R be defined by f(x) = (a/2) sin(x) — a. 
i) Prove that there is exactly one c € [0,7 /2] such that f’(c) = 0. 
ii) Prove that (7/2) sin(x) > x for all x € [0,7/2]. (Compare with Exercise 10.3.2.) 
10.3.9. Prove that for all x € R, 
_ 1-cos(2z) _ 1+ cos(2z) 


(sin(x))” = ——>——,,_ (cos(z))” = 5 


(Hint: Exercise 10.2.3.) 
10.3.10. Let f be sin or cos. 
i) Compute the Taylor polynomial Tio,¢,9 of f of degree 10 centered at 0. 
) Compute Tio, ¢,0(0) and Tio, ¢,0(1). 
iii) Estimate the errors | f(0) — Tyo, f,0(0)| and |f(1) — T10,¢,0(1)| with Theorem 6.5.5. 
) 


iv) In fact, calculators and computers use Taylor polynomials to compute values of 


il 


trigonometric and also exponential functions. Compute f(1) and f(100) to within 
0.000001 of their true value. What degree Taylor polynomial suffices for each? 


10.3.11. Prove that tan~'(x) + tan7'(4+) = % for all x € R \ {0}. (Hint: take the 


x 
derivative of the expression on the right and use a clever 2.) 


10.3.12. Use integration by substitution (Exercise 7.4.5). 
i) Determine antiderivatives of tan and cot. 
ii) Determine antiderivatives of sec and csc. (Hint: Use first the rewriting trick 
seo() = ee ayitanta)) 
10.3.13. Compute {(sin(z))? dx (or [(cos(x))? dx) twice, with two different methods: 
i) Use Exercise 10.3.9. 


ii) Use integration by parts twice (Exercise 7.4.6) and rewrite after the second usage. 
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10.3.14. 
1) Compute limy—+.0 lar sin(x) dx. 
2) Compute limy_+.. Soon in(x) dz. 


3) Compute limy_+.. fia onal sin(x) dx. 


4) Conclude that sin is not integrable over (—oo, 00) in the sense of Exercise 7.4.10. 


PN eee 


10.3.15. Let k,j € Z. Prove the following equalities (possibly by multiple integration by 
parts (Exercise 7.4.6)): 


i) ["_sin(kt) cos(jt) dt = 0. 
ii) [“_sin(kt) sin(jt) dt = i: 


iii) ["_cos(kt) cos(jt) dt = ie 


10.3.16. Prove that for any non-zero integer k, 


ig kor 7k = 0; 
otherwise. 
ity Ak or jk, 
otherwise. 


1 
— sin(kx) + C, 


1 
[ esin tex) cS =o cos(ka) + 72 


and that 


1 1 
[ vcostke) d= aa sin(ka) + RB cos(ka) + C. 


10.3.17. Lette RandneN*. 
i) Prove that sin((n + $)t) =sin((n — $)t) + 2cos(nt) sin($t). (Hint: Exercise 10.2.3, 
multiple times.) 
ii) Prove that sin((n + 4)t) = sin($t)(1 + 25°;_, cos(kt)). 


iii) Give a formula or formulas for simplifying )7/_, cos(kt). 


10.3.18. (Fourier analysis, I) Let f : [-a,a] — C be a differentiable function (or pos- 
sibly i quite differentiable, but that would take us - far). For n € No define a, = 
{2 f(x) cos(nz) dx, and for n € N* define 6, = f"_ f(x) sin(nx) dz. The Fourier series 
of f is 


> + 2 Ay, cos(nx) + by, sin(nz)). 


i) Compute the Fourier series for f(x) = sin(3x). (Hint: Exercise 10.3.15.) 


ii) Compute the Fourier series for f(z) = (sin(z))?. (Hint: Exercises 10.3.9 and 
10.3.15.) 


iii) Compute ao, a1, G2, b1, b2 for f(x) = x. (Hint: Exercise 10.3.16.) 
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10.3.19. (Fourier analysis, II) Let f : [-17,a] — C be a differentiable function (or 
possibly not quite differentiable, but that would take us too far). For n € Z define 
an = f"_ f(x)e” dx. The Fourier series of f is 


CO 
y a,e ”. 


n=—oCo 
Compute the Fourier series in this sense for the functions in the previous exercise. 
10.3.20. Use Exercise 10.2.11 to elaborate on the two types of Fourier series from the pre- 


vious two exercises for any polynomial function. Note that Fourier series are also functions 
in the form of infinite sums but they are not power series. 


10.4 Examples of L’H6pital’s rule 


Several versions of L’Hopital’s rule were proved in Section 6.4. With an increased 
repertoire of functions we can now show more interesting examples, including counterex- 


amples if some hypothesis is omitted. All work in this section is in the exercises. 


Exercises for Section 10.4 


10.4.1. The proof of Theorem 10.2.5 required knowing that lim sin(s) = 1. Re-prove this 
Hea 


with L’H6pital’s rule. Discuss why this is not an independent confirmation of the fact. 


10.4.2. Prove that iim Jcos(#) = 0 first by L’Hopital’s rule. Then justify all steps in the 
following proof: 
1 — cos(x) _ l-—cos(x) 1+ cos(z) 
z30 206 fté~<Cs«a 8 x "1 +cos(z) 
sies 1 — (cos(x))? 
«0 “(1 + cos(x)) 
(sin(x))? 


aes ) 
=| 


40 x(1 + cos(z)) 


leis sin(z) _sin(z) 
z30 2 1+ cos(x) 
0 
a eae 
de et 
= 0. 


The next two exercises are taken from R. J. Bumcrot’s article “Some subtleties in 
L’Hopital’s rule” in Century of Calculus. Part IT. 1969-1991, edited by T. M. Apostol, 
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D. H. Mugler, D. R. Scott, A. Sterrett, Jr., and A. E. Watkins. Raymond W. Brink 


Selected Mathematical Papers. Mathematical Association of America, Washington, DC, 
1992, pages 203-204. 


10.4.3. Let f(x) = x? sin(x—*) and g(z) = sing. 


i) Prove that limz-4o ae = 


ii) Prove that lim, _.9 eae does not exist. 
iii) Why does this not contradict L’H6pital’s rule Theorem 6.4.2? 
10.4.4. Let f(x) = asin(x~') and g(x) = sinz. 
i) Compute lim,_.9 f(x) and lim,_,9 g(a). 
ii) Compute f’ and g’. 


iii) Prove that neither lim, ,9 oe nor lim,_.o ae exist. 
*10.4.5. (This is from R. P. Boas’s article “Counterexamples to L’H6pital’s rule”, Amer- 


ican Mathematical Monthly 93 (1986), 644-645.) Define f(z) = {5 cos*(t)dt, g(x) = 
f(2) 


2+sin(x)° 
i) Prove that lim f(x) = oo = lim g(z). 
ii) Prove that lim AC) 0h 


iii) Compute f’(x), g/(z). 
iv) Write f’(x) = fi(x 


that lim file) 
xL—->CO G1 (x) 


cos and g'(x) = gi(x)-cosz for some functions f,,g1. Prove 


) : 

= 0. 
/ 

PO) 46, 

g' (x) 

vi) Why does this not contradict L’H6pital’s rule at infinity (proved in Exercise 6.4.4)? 


v) Prove that lim 
LOO 


*10.4.6. (This is from R. C. Buck’s “Advanced Calculus”, McGraw-Hill Book Company, 


Inc., 1956, page 48.) Let f(z) = 27 +sin(2x), g(x) = (2x + sin(2zx))e8™), and h(x) = 
— sin(x) 
€ . 


i) Prove that lim,-,.. h(x) does not exist. 


ii) Prove that limz +o. oe) does not exist. 

iii) Verify that f’(x) = 4cos?(x) and that g(a) = (4cos(x)+2x¢+sin(2x)) cos(x)e™®), 
A : 4 cos(x 

iv) Prove that lim... el Soul) EEa Te (ayy =(); 

v) Why does L’Ho6pital’s rule not apply here? 


*10.4.7. (Modification of the previous exercise.) Let f(x) = 2 + sin(2), and g(x) = 
(2+ sin(2))esn(=), Prove that lim,_,9+ oe does not exist but that lim, _,9+ aie =20, 
Why does L’Hopital’s rule not apply here? 
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10.5 Trigonometry for the computation of some series 


The goal of this section is to handle a few more infinite series and power series. 
Namely, so far we have been able to add up few infinite series numerically, and here we 
compute for example )°7~_, Zz in two different ways. (In Example 9.1.9 we proved that 
the sum converges but we did not know what it converges to.) All work is in the exercises. 
As the title suggests, these uses are not covered in a standard first analysis course and it 
is fine to omit this section. 


Exercises for Section 10.5 


10.5.1. (This is taken from the Monthly Gem article by D. Velleman in the American 
Mathematical Monthly 123 (2016), page 77. The gem is a simplification of the article 
by Y. Matsuoka, An elementary proof of the formula }\7°,k~? = 7?/6, in American 
Mathematical Monthly 68 (1961), 485-487.) For every non-negative integer n define [,, = 
Io? (cos(x))?” da and i pe x7 (cos(#))?" da. 
i) Prove that Ip = 1/2 and Jo = 13/24. 
ii) Prove that for all n > 1, I, = (2n —1)U,_-1 — J,). (Hint: integration by parts 
(Exercise 7.4.6) and dv = cos(x) dz.) 
iii) Prove that I, = an—t -In-1- 
iv) Use integration by parts twice, first with dv = dx and then with dv = 2xdz, to 
prove that for alln > 1, I, =n(2n —1)Jy_1 — 2n7In. 
v) Prove that for alln >1, 4 =2 (4 — 7). 
vi) Prove that for all integers N, 
Ss 1 =2(2 mw) = o JN 
a ke Io In 6 In - 


vii) Use the inequality x < (7/2) sin(x) for x € [0,7/2] (see Exercise 10.3.8) to estimate 


2 m/2 
0< Jn < =f (sin(ax))?(cos(a))?% dx 
0 
2 
T 
= 7 Un — In41) 
n i 
oe = 
4 2N+2™ 
viii) Prove that > rae > 


k=1 
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*10.5.2. (This exercise is used in Exercise 10.5.3.) Use de Moivre’s formula (see Exer- 


cise 10.2.4) to prove that for all x € R and all positive integers n, 


cos(nx) + isin(nx) = (sin(a))”(cot(a) +7)” 


n 


= (sin(x))” 5° (j,) et(cot(ayyr- 


k=0 
If n = 2m +1, this says that 
cos((2m + 1)a) + isin((2m + 1)x) 
2m+1 
2 1 
Siena) ry ( a J ®oor(e)yem* 


k=0 


The imaginary part of this is: 


2m+1 


sin((2m + 1)x) = (sin(«))?"*" Ser | 


) (cot(a))2"—2*, 


Let P(x) = yay eae Prove that P is a polynomial of degree m with 
leading coefficient 2m +1. Prove that for k =1,...,m, (cot(ka/(2m+1)))? is a root of P. 


Prove that P has m distinct roots. Prove that 


P(a) = (2m +1)- Hi (-- (cot (5) ] 


By equating the coefficient of z™~1 in the two forms of P prove that 5°; (cot(km/(2m + 
eee 


*10.5.3. (See also Exercises 10.5.1 and 10.5.5.) (This is from the article I. Papadimitriou, 
A simple proof of the formula S77... k7~? = 2/6, American Mathematical Monthly 80 
(1973), 424-425.) Let « be an angle measured in radians strictly between 0 and 7/2. Draw 
the circular wedge with angle x on a circle of radius 1. The largest right triangle in this 
wedge whose hypotenuse is one of the wedge sides of length 1 has area 4 sin(x) cos(x), and 
the smallest right triangle containing this wedge whose side is one of the wedge sides (of 
length 1) has area $ tan(z). 

i) Prove that sin(x) cos(x) < « < tan(x), and that (cot(x))? < 4. 

ii) Use Exercise 10.3.2 to prove that & < 1+ (cot(z))?. 

iii) Prove that for all integers k,m with 1 <k <™m, 


s ee, ke *  @m +1) =o 4 
2n+1 nT? k2 


k=1 
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2 
iv) Use Exercise 10.5.2 to prove that mGm=1) < Gm) re < mt meme 
lee) 2 F : ‘ 
v) Prove that )77, a = *,. (Hint: multiply the previous part by n? /4m?.) 


10.5.4. Fill in the explanations and any missing steps in the two double improper inte- 
egrals. (While integrating with respect to x, think of y as a constant, and while integrating 
with respect to y, think of x as a constant.) 


. dx | dy = —— ——— dx | d 
/ Tay lapay ) : : ee 1+ ay 
nny ne Yh 
9 lty\Now Jo 1tax*y 
ae 1 Pe arctan(,/yx) % ny 


nr ff 1 
a, ieay 


=i f° Qu du 
— 2 Jo u(1 + u?) 


(by substitution y = u7) 
2 


9° 
dy \d 
[ (/ Leh eb eey v) : 
cy Gey cae 1 x? 
— dy \ d 
i. (/ ae Ce ar) v) i 
oO 4 i 
= ia (32) & 


x 1 
1 i 
Inz Inu 
= d. d 
(| z?—1 o+ f u2—1 u) 
Inz 


=4 Rerar) dx. (Stop here.) 
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*10.5.5. (This is from the article D. Ritelli, Another proof of ¢(2) = _ American Mathe- 
matical Monthly 120 (2013), 642-645.) We proved in Section 9.4 that derivatives and (def- 
inite) integrals commute with infinite sums for power series. There are other cases where 
integrals commute with infinite sums, but the proofs in ercalct generality are harder. Ac- 
cept that 1 3dr = = fol =I) S590 de =n ele (—Inx)x?*dx. Also accept that 
the two integrals in Exercise 10.5.4 are the same (order of integration matters sometimes). 


2 


Use Exercises 7.6.7 and 9.2.1 to prove that )°7° Zz = 5: 


Appendices 


Appendix A. Advice on writing mathematics 


Make your arguments succinct and straightforward to read 


Write preliminary arguments for yourself on scratch paper: your first attempt may 
yield some dead ends which definitely should not be on the final write-up. In the final 
write-up, write succinctly and clearly; write what you mean and mean what you write; 
write with the goal of not being misunderstood. Use good English grammar, punctuate 
properly. And above all, use correct logical reasoning. 

Do not allow yourself to turn in work that is half-thought out or that is produced 
in a hurry. Use your best writing, in correct logical order, with good spatial organization 
on paper, with only occasional crossing out of words or sections, on neat paper. Represent 
your reasoning and yourself well. Take pride in your good work. 


Process is important 


Perhaps the final answer to the question is 42. It is not sufficient to simply write “42”, 
“The answer is 42,” or similar, without the process that led to that answer. While it is 
extremely beneficial to have the intuition, the smarts, the mental calculating and reason- 
ing capacity, the inspiration, or what-not, to conclude “42”, a huge part of learning and 
understanding is to be able to explain clearly the reasoning that lead to your answer. 

I encourage you to discuss the homework with others before, during or after com- 
pleting it: the explanations back-and-forth make you a better thinker and expositor. 

Write your solutions in your own words on your own, and for full disclosure write 
the names of all of your collaborators on the work that you turn in for credit. I do not 
take points off, but you should practice full honesty. 

Sometimes you may want to consult a book or the internet. Again, on the work that 
you turn in disclose the help that you got from outside sources. 

Keep in mind that the more you have to consult outside sources, the more fragile 
your stand-alone knowledge is, the less well you understand the material, and the less likely 
you are to be able to do satisfactory work on closed-book or limited-time projects. 


Do not divide by 0 


Never write “1/0”, “0/0”, “07/0”, “oo/0”. (Erase from your mind that you ever saw 
this in print! It cannot exist.) 
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Sometimes division by zero creeps in in subtler ways. For example, to find solutions 
to x? = 3a, it is wrong to simply cancel the x on both sides to get only one solution x = 3. 
Yes, x = 3 is one of the solutions, but « = 0 is another one. Cancellation of x in x? = 3x 
amounts to dividing by 0 in case the solution is x = 0. 


Never plug numbers into a function that are not in the domain of the function 


By design, the only numbers you can plug into a function are those that are in the 
domain of the function. What else is there to say? 

I will say more, by way of examples. Never plug 0 into the function f(x) = 4 (see 
previous admonition). Even never plug 0 into the function f(2) = =: the latter function 
is undefined at x = 0 and is constant 1 at all other «x. 


Never plug —1 into Vor into In. 


Do not plug x = 0 or x = 1.12 into f that is defined by f(x) = ai a 


9x +1, ifa<—l. 


Order of writing is important 


The meanings of “Everybody is loved by somebody” and “Somebody loves every- 
body” are very different. Another way of phrasing these two statements is as follows: “For 
every x there exists y such that y loves x,” and “There exists y such that for every x, y 
loves x.” In crisp symbols, without distracting “such that”, “for”, and commas, these are 


written as “Vx dy, y loves x” and “Ady Vax, y loves x.” 

Conclusion: order of quantifiers matters. 

The order also matters in implications; simply consider the truth values of “If x > 2 
then x > 0” and “If x > 0 then x > 2.” 

The statement “if A then B” can be written also as “B if A”, but in general it is 
better to avoid the latter usage. In particular, when writing a long proof, do not write out 
a very long B and only at the very end add that you were assuming A throughout; you 
could have lost the doubting-your-statements reader before the end came in sight. 

Here is a fairly short example where “B if A” form is not as elegant. With the 
statement “2 is ... if ¢ 4 0” you might have gone over the abyss of dividing by 0 and no 
ifs can make you whole again. It is thus better to first obtain proper assurances, and write 
“if ¢ 0 then 2 Issac 


Write parentheses 
-— is not a recognized binary operator. Do not write “5-—2”; instead write “5-(—2)”. 
lim 4-32 =4—- 3a, whereas lim (4-32) =7. 
xz——1 xz->—1 


“[ 4 — 3a dx” is terrible grammar; instead write “(4 — 3x) dx”. 
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m does not equal 3.14159 


If the answer to a problem is 7/17 /59, leave it at that. This is an exact number from 
which one can get an approximation to arbitrary precision, but from 0.21954437870195 one 
cannot recover further digits. Never write “r/17 /59 = .21954437870195”, but it is fine to 
write “1/17/59 & .21954437870195”. Usually it is not necessary to write out the numerical 
approximation, but sometimes the approximation helps us get a sense of the size of the 
answer and to check our derivation with any intuition about the problem. The answer to 


how far a person can run in one minute certainly should not exceed a kilometer or a mile. 


To prove an (in)equality, manipulate one side in steps to get to the other 


If you have to prove that S>;_, k? = aint Gard) for n = 1, do NOT do the 
following: 


se (tayo s) . a a 
kal 6 a ee OC 
(ees: we Ft Z 
{72= a es] Oo 
6 es B 4 

= ry 

esl "4 


The reasoning above is wrong-headed because in the first line you are asserting the 
equality that you are expected to prove, and in subsequent lines you are simply repeating 
your assumptions more succinctly. If you add question marks over the three equal sums 
and a check mark on the last line, then you are at least acknowledging that you are not 
yet sure of the equality. However, even writing with question marks over equal signs is 
inelegant and long-winded. That kind of writing is what we do on scratch paper to get our 
bearings on how to tackle the problem. But a cleaned-up version of the proof would be 
better as follows: 


=l|]= — . 
6 


ee Pepe 3 ys) 
7 6 
k=1 


Do you see how this is shorter and proves succinctly the desired equality by transitivity of 
equality, with each step on the way sure-footed? 

Another reason why the three-line reasoning above is bad is because it can lead to 
the following nonsense: 


9 
i ex) 
? 
add 1 to both sides: 24 
? 
multiply both sides by 0: = 0 o 
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But we certainly cannot conclude that the first line “1 = 0” is correct. 


Do not assume what is to be proved 


If you have to prove that a is positive, do not assume that a is positive. (For a 
proof by contradiction suppose that a < 0, do correct logical reasoning until you get a 
contradiction, from which you can conclude that a < 0 is impossible, so that a must have 
been positive. See page 21 for proofs by contradiction.) 


Limit versus function value 


Asking for lim,_,, f(x) is in general different from asking for f(a). (If the latter is 
always meant, would there be a point to developing the theory of limits?) 


Do not start a sentence with a mathematical symbol 


This admonition is in a sense an arbitrary stylistic point, but it helps avoid certain 
confusions, such as in “Let « > 0. x can be taken to be negative.” Here one could read 
or confuse the part “0. x” as “O- x2 = 0”, but then € would have to be both positive and 
negative. Do not force a reader to have to do a double-take: write unambiguous and correct 


sentences. 


Appendix B. What one should never forget 


Logic 


We should remember the basic truth tables, correct usage of “or” and of implications, 
how to justify/prove a statement, and how to negate a statement. 


If A implies B and if A is true, we may conclude B. 
If A implies B and if B is false, we may conclude that A is false. 
If A implies B and if A is false, we may not conclude anything. 


If A implies B and if B is true, we may not conclude anything. 


Truth table: 
PQ | not P Pond OP or@: 2 xo 2s 0) uP 
T T F T fi F fly oe 
T F F F oP ai F F 
F T T F fg T T F 
F F a F F F T a 
Negation chart: 
Statement Negation 
not P P 
P and Q (not P) or (not Q) 
PorQ (not P) and (not Q) 
P=>Q P and (not Q) 
PsQ P< (not @)}= (nouP) SQ 
For all x of the specified type, There exists x of the specified 
property P holds for x. type such that P is false for x. 
There exists x« of the specified type | For all x of the specified type, 
such that property P holds for 2. P is false for x. 


[Statement |] How to prove it 
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P (via contradiction). 


Suppose not P. Establish some nonsense 
(that makes not P thus impossible so that 
P must hold). 


PandQ. 


Prove P. Prove Q. 


PorQ. 


Suppose that P is false. Then prove Q. 
Alternatively: Suppose that Q is false and 
then prove P. 

(It may even be the case that P is true 
always. Then simply prove P. Or simply 


prove Q.) 


If P then Q. 


Suppose that P is true. Then prove Q. 
Contrapositively: Suppose that Q is false. 
Prove that P is false. 


Pe), 


Prove P > Q. Prove Q => P. 


For all x of a specified 
type, property P holds for 
iL: 


Let x be arbitrary of the specified type. 
Prove that property P holds for x. 
(Possibly break up into a few subcases.) 


There exists x of a speci- 
fied type such that prop- 
erty P holds for «x. 


Find/construct an x of the specified type. 
Prove that property P holds for x. 
Alternatively, invoke a theorem guarantee- 
ing that such x exists. 


An element x of a speci- 
fied type with property P 


is unique. 


Suppose that x and 2’ are both of spec- 
ified type and satisfy property P. Prove 
thaha <2 

Alternatively, show that x is the only so- 
lution to an equation, or the only element 


on a list, or .... 


x with property P is 


unique. 


Suppose that x and y have property P. 
Prove that x = y. 


Appendix B: What one should never forget 377 


Mathematical induction 


The goal is to prove a property for all integers n > no. First prove the base case, 
namely that the property holds for no. For the inductive step, assume that for some 
n—1> no, the property holds for n — 1 (alternatively, for no,no + 1,...,n—1), and then 
prove the property for n. 


The limit definition of derivative 


f(a) = tim LOFM=LO© — jpg = LO) 
h—0 h ma £-a 
The limit-partition definition of integrals 
A partition of [a,b] is a finite set P = {9,21,...,2%,} such that m9 = a < x < 


LQ<+++ << Bpn-1 < Ln =. Let f : [a,b] — R be a bounded function. For each j = 1,...,n, 
let 


fy = Int f(e)c@ € [e543 05), My = supl f(a): 2 -e [a4 25|h 


The lower sum of f with respect to P is 


P)= dems (a3 Spat): 


The upper sum of f with respect to P is 


Me jo Uj 1)- 


The lower integral of f over [a,b] is 
L(f) =sup{L(f, P) : as P varies over partitions of [a, b]}, 
and the upper integral of f over |a, b] is 
U(f) =inf{U(f, P): P varies over partitions of [a, b]}. 


We say that f is integrable over [a,b] when L(f) = U(f). We call this common value 
the integral of f over [a,b], and we write it as 


fr freee fro 


378 Appendix B: What one should never forget 


The Fundamental theorems of calculus 


I: Let f,g : [a,b] > R such that f is continuous and g is differentiable with g' = f. Then 


b 
: f =9(0) - g(a). 


II: Let f : [a,b] > R be continuous. Then for all x € |a, 6], f is integrable over [a, x], and 
the function g : [a,b] + R given by g(x) = J” f is differentiable on (a,x) with 


=f t=#. 


Geometric series 
isan r® diverges if |r| > 1 and converges to x if |r| <1. 
> pot diverges if |r| > 1 and converges to ;+- if |r| <1. 
Forallre C\ {1h yy 6 rk — 


petty 


r—1 


Never divide by 0 


It bears repeating. Similarly do not plug 0 or negative numbers into In, do not plug 
negative numbers into the square root function, do not ascribe a function (or a person) a 
task that makes no sense. 


Index 


SYMBOLS 

o abstract binary operation 77 
C 50, 139 

ca n choose k 46 

A® complement of A 53 

\ complement of sets 53 

°o composition of functions 69 
cos 23, 358 

e (identity element for binary operation) 77 
e: Euler’s constant 269 

{} empty set 50 

—) empty set 50 


4 there exists 25 

V for all 25 

f” function composed with itself 69 

sin” power of trigonometric function (not 
composition) 69 

> (strictly) greater than 87 

> greater than or equal to 87 

e (identity element for binary operation) 77 

id (identity function) 67 

< if and only if 17 

= implies 16 

€ in a set 49 

inf: infimum 88 

M intersection 52 

< (strictly) smaller than 87, 89, 91, 110 

> (strictly) greater than 110 

> greater than or equal to 110 

< less than or equal to 87, 110 

lim of sequence 283 

lim inf 314 

lim sup 314 

In: natural logarithm 266 

- logical not 14 

V logical or 15 


/ logical and 15 

L(f,P): lower sum 241 
N,N*,No 50 

No 104 

+ 99 

ths I] product 35, 79 
Q 50, 121 

// end of proof 18 

QED end of proof 18 

end of proof 18 

B end of proof 18 

R 50, 126 

R>o 50 

Range: range of a function 66 
*/ 138, 206, 207 

sin 23, 24, 358 

C subset of a set 52 

¢ proper subset 52 

Z not a subset 52 


paar ee ae, 


k=1 
sup: supremum 88 


U union 52 

U(f, P): upper sum 241 
Z 50, 114 

0° 34 


A 

Abel’s lemma 343 

absolute value 19, 98 
absolutely convergent 326 
advice on writing proofs 371 
alternating test for series 325 
and (logical) 15 

antecedent 16 
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antiderivative 259 

arcsin, arccos, arctan 359 
area (signed or usual) 239 
associative 78, 83, 84 
axiom 103 


B 

base case (of induction) 36, 40 

base of exponentiation 86, 268 

Benjamin, A. 44 

Bernoulli’s inequality 96 

bijective function 69 
composition 70 

binary operation 77 
associative 78 
commutative 81 

binomial coefficient 46 

Boas, R. P. 364 

Bombelli, R. 147 

bounded function 240 

bounded set 88 

Buck, R. C. 364 

Bumcrot, R. J. 363 


C 
C 50, 139 
absolute value 144 
arithmetic 139 
Cartesian coordinates 148 
distance 144 
length, norm, absolute value 144 
polar coordinates 148 
properties 140 
reverse triangle inequality 145 
triangle inequality 145 
Campbell, P. J. 357 
cancellation 81, 113, 120, 123 
Cartesian coordinates 148 


Cartesian product 59 
Cauchy’s criterion for series 323 
Cauchy’s mean value theorem 230 
ceiling function 72 
chain rule 222 
chart 
how to prove 29, 376 
negation 32, 375 
choose: n choose k, (2) 46 


closed and bounded—contained in open 


sets 159, 160 

closed set 156 
closed-form antiderivative 259 
codomain (of a function) 66 
commutative 81, 83, 84 
compact set 159 
comparison 

sequences with infinite limits 292 

sequences 298, 303 

series 324 


completeness of R, C 308 
complex conjugate 143 
complex number 139 
with e 356 
composition of functions 70 
conditional statement 15 
antecedent, consequent 16 
beware 17 
congruent modulo n 63 
conjunction 15 
connuousti 197 
consequent 16 
construction of No 104 
construction of Q from Z 66, 120 
construction of R 126 


construction of Z from No 64 
containment 49 


continuous 193 


and inverse function 203 

and monotonicity 203 

connuousti 197 

exponential function 209, 210 

exponentiation 207 

image of interval 203 

power series 344 

properties 194 

suounitnoc 197 

ticonnuous 197 

uniformly continuous 210, 231 
convexity 229 


cos: trigonometric function 23, 354, 355, 


358 
COS 352 
cover, subcover 159 


D 
Darboux’s theorem 227 
de Moivre’s formula 357, 366 
Dedekind cut 125 
standard 133 
definition: beware 17 
degree (of polynomial) 34 
derivative 214 
and monotonicity 229 
complex-valued functions 223 
inverse 223 
Leibniz’s notation 214 
Newton’s notation 214 
power rule for rational exponents 225 
properties 219, 221, 222, 223, 225 
differentiable 214 
hence continuous 219 
Dirichlet’s test 328 
disjunction 15 
distributivity 83 
division 84 
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do not divide by zero 85 
domain (of a function) 66 
implicit 68 


E 
evs. 7 271 
Eantegral 249 
element 49 
empty product 36, 80 
empty set 50 
empty sum 35 
enumeration of Qt 280 
epsilon-delta definition 161, 170 
epsilon-N definition 191 
equivalence 

class 62, 114 

logical 17 

relation 61, 114 
Euclidean algorithm 39 
Euclid 33 
Euler’s constant (e) 269, 350 
even function, its integral 73 
even integer 18, 22 
exclusive or (xor) 15 


existential quantifier 4 25 
exponential function 268 
continuity 210 
without derivatives and integrals 209 
exponentiation 86, 207 
by real exponents 268 
continuity 207 
exponent 86, 268 
Extreme value theorem 199 


F 

Ferguson, S. J. 200 

Fibonacci numbers 42, 67, 279 
field 83, 122, 131, 140 
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axioms 83 
unusual 87 
fixed point theorem 204 
floor function 72 
Fourier series 362, 363 
Fueter, R. 280 
function 66 
bijective 69 
ceiling 72 
codomain 66 
composition 69 
connuousti 197 
constant 67 
continuous 193 
decreasing 96, 97 
domain 66 
exponential 209, 210, 268 
floor 72 
graph 66 
identity 67 
increasing 96, 97 
injective 69 
monotone 96 
inverse 97 
one-to-one 69 
onto 69 
polynomial 34, 71 
power 97 
product of 86 
radical 97, 206 
range 66 
rational 71 
suounitnoc 197 
surjective 69 
ticonnuous 197 
Fundamental theorem of arithmetic 31 
Fundamental theorem of calculus 258, 260, 
261 


complex-valued 265, 266 


G 

Gaitanas, K. 42 

generalized power function 268 
generalized power series 334 
geometric series 321 

Goldbach’s conjecture 13, 76 
Greatest lower bound theorem 135 
greatest lower bound: see infimum 


H 

halving intervals 158 
harmonic sequence 309 
harmonic series 321 

higher order derivatives 234 
Hoseana, J. 186 
how-to-prove chart 29, 376 


I 

identity element 77 
additive 83, 84 
multiplicative 83, 84 

identity function 67 

if and only if, iff 15 

imaginary axis 139 


imaginary part of a complex number 141 


implication 15 
improper integral 264, 265 
indefinite integral 259 
index (of unions, intersections) 53 
induction 36, 104 

base case 36, 40 

inductive step 36, 40 
inductive set 103 
infimum, inf, glb 88 

—oo 314 
infinite sequence: see sequence 


infinite series: see series 
injective function 69 
composition 70 
integer 93, 114 
integrable 243, 253 
integral 243 
addition 255 
applications 272 
complex-valued function 265 
double 367 
monotonicity 256 
notation 243, 246, 258, 261 
test for series 326 
integration by parts 263 
integration by substitution 262 
Intermediate value theorem 202, 227 
interval 49, 90 
inverse 78, 97 
additive 78, 84 
derivative 223 
multiplicative 78, 84 
of a function 75 
of a strictly monotone function 97 
of product 80 
unique 79 
invertible (element) 79 
invertible (function) 75 


J 
joke 28 


L 


L’Hopital’s rule 231, 232, 270, 271, 363 


at infinity 234, 270, 364 
Lagrange interpolation 75 
least element 90, 95, 111 
Least upper bound theorem 135 
least upper bound: see supremum 


Index 


Leibniz, G. 147 
lemma: helpful theorem 343 
length of curve 272 
limit inferior, liminf 314 
limit point (of set) 155 
limit superior, limsup 314 
limit 
absolute value 183 
at infinity 191 
composite function theorem 184 
epsilon-delta definition 161, 170 
epsilon-N definition 191 
infinite 187 
M-delta definition 187 
M-N definition 191 
of a function 161, 170, 187, 191 
of sequence 283 
polynomial rule 182 
power rule 182 
rational rule 183 
real/imaginary part 183 
right /left-handed 170, 187 
squeeze theorem 184 
subsequence 312 
theorems for infinite limits 189 
theorems 180, 182, 183, 184 
uniqueness 179 
In, natural logarithm 266 
inverse 350 
logarithmic differentiation 270 
logic circuits 24 
logical biconditional 17 
logic 13, 25, 28 
lower bound of set 88 
lower integral 243 
lower sum (integrals) 241 


383 


384 Index 


M open set 153 
Matsuoka, Y. 365 or (logical) 15 
Mean value theorem for integrals 263 ordered pair 58 
Mean value theorem 228 ordered set 91 
member (of a set) 49 field 91 
MeStrovié, R. 33 order 87 
modulo n 63 No 110 
modus ponens 18 Q 123 
monotone function 96 R 133 
and derivative 229 Z 118 
monotone sequence 304, 307 
P 
N p-test for series 327 
N, N* 50 Papadimitriou, I. 366 
No 50, 93, 104 parallelogram rule for addition 139 
natural logarithm, In 266 partial sum (of a sequence) 319, 320 
natural number 104 partition (of an interval) 240 
negation 14, 32 refinement 242 
chart 32, 375 partition 
negative 90, 119, 124, 134 sub-interval 240 
Nelson, E. 197 Pascal’s triangle 45 
non-negative 90, 119, 124, 134 perfect number 51 
non-positive 90, 119, 124, 134 mT vs. e 271 
not: see negation pigeonhole principle 74 
notation polar coordinates 148 
integral 258, 261 polynomial function 34, 71, 72, 142, 182, 
N,NtT,No 50 235, 297, 302 
set vs. sequence 278 degree 34 
Ci GxbZ Taylor polynomial 235 
number of elements in Q* 76 popcorn function 197 
positive 90, 119, 124, 134 
O power function 96, 97, 98, 182, 205, 297, 
odd function, its integral 73 302 
odd integer 18, 22 generalized 207, 268 
one-to-one 69 power notation 34, 69, 80 
onto 69 0° 34 
open ball 153 for functions 69, 83 


open cover 159 on sets with binary operation 80 


special in trigonometry 69 
power rule for derivatives 221 
generalized 225 
power series 329, 334 
continuous 344 
derivative 337 
generalized 334 
inverse 343 
numerical evaluation 338 
product of two 340 
radius of convergence 331 
ratio test 332 
root test 331 
Taylor series 344 
uniqueness 337 
prime 13, 18, 28, 31, 33 
definition 17 
principle of mathematical induction 36 
product [];_, 35, 79 
product (not composition) of functions 86 
product rule for derivatives 219 
product rule for higher derivatives 238 
proof 18, 28 
by contradiction 21 
by induction 36 
deductive reasoning 18 
how-to-prove chart 29, 376 
pigeonhole principle 74 
/2 is not rational 21 
proper subset 52 
Pythagoras 21 
Polya, G. 280 


Q 

Q 50, 121 
Archimedean property 124 
cancellation 123 
field 122 
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ordered 123 
QED (quod erat demonstrandum) 18 
quadratic formula 23, 71 
quantifier 
existential 25 
universal 25 
quartering rectangles 158 
quotient rule for derivatives 219, 238 


R 
R 50, 126 
Archimedean property 134 
field 131 
ordered 134 
properties 131, 134 
Raabe’s test 329 
radical function 97, 206 
radicals exist 137, 206 
radius of convergence (of power series) 331 


range (of a function) 66 
ratio test 
for power series 332 
for sequences 305, 315 
for series 324 
rational function 142 
rational number (in Q) 121 
rational numbers between real numbers 
135, 136 
rational number 21, 93 
real axis 139 
real number 126 
real part of a complex number 141 
recursive 279 
refinement (of a partition) 242 
relation on sets 60 
equivalence 61 
reflexive 61 
symmetric 61 
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transitive 61 
restriction of a function 69, 194 
reverse triangle inequality 100, 134, 145 
Ritelli, D. 368 
Rolle’s theorem 228 
root of a function 71 
root of a number 138, 206, 207 
root test for power series 331 
root test for series 325 


NS) 
Sagher, Y. 76 
second derivative test 237 
sequence vs. set notation 278 
sequence 278 
arithmetic on sequences 281 
bounded (below/above) 304 
Cauchy 307 
comparison for infinite limits 292 
comparison 298, 303 


completeness of R, C 308 
composite rule 297, 302 
constant 278 
convergence 283, 300 
properties 295, 297, 301, 302 
convergent 283 
divergence to +oo 292 
divergence 291 
Fibonacci 279 
finite 282 
limit 283 
unique 295, 301 
monotone 304, 307 
of partial sums 319, 320 
ratio test 305, 315 
squeeze theorem 299, 303 
subsequence 311 
term 278 


series 
absolutely convergent 326 
alternating test 325 
arithmetic with 320 
comparison 324 
converges 320 
diverges 320 
geometric 321 
harmonic 321 
integral test 326 
numerical evaluation 338 
p-series test 327 
power 329, 334 
ratio test 324, 332 
root test 325, 331 
set vs. sequence notation 278 
set 49 
complement 53 
disjoint 52 
empty set 50 
equality 52 
intersection 52 
member 49 
notation 49 
propositionally 51 
subset 52 
sum of 92 
union 52 
universal 53 
signed area 239, 241 
sin: trigonometric function 23, 24, 354, 
355, 358 
SIN 352 
socks-and-shoes 82 
spiral of Theodorus 44 
statement 13 
conditional statement 15 
conjunction 15 


disjunction 15 
equivalence 17 
if and only if 15 
implication 15 
logical biconditional 17 
negation 14 
xor 15 
sub-interval of a partition 240 
subsequence 311 
subsequential limit 312 
subset 52 
subtraction 84 
successor 103 
sum of sets 92 
yo fk? 365, 367 
summation )>;_, 34, 79 
summation by parts 328 
suounitnoc 197 
supremum, sup, lub 88 
co 314 
surface area of a surface of revolution 275 
surjective function 69 
composition 70 


T 
tautology 18 
Taylor polynomial 235, 271, 361 
Taylor series 344 

not approximating 348 
Taylor’s remainder theorem 236, 237, 263 
Thomae function 197 
ticonnuous 197 
topology 153, 155, 156 

closed set 156 

general 157 

limit point 155 

open ball 153 

open set 153 
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tower of Hanoi 44 
triangle inequality 19, 100 
over C 145 
reverse 100, 134, 145 
trichotomy (of <) 89, 111, 118 
trigonometry 354, 358 
tromino 44 
truth table 14, 375 


U 

uniform continuity 210, 231 
universal quantifier V 25 
universal set 53 

unusual field 87 

upper bound of set 88 
upper integral 243 

upper sum (integrals) 241 


Vv 

vacuously true 26 

Velleman, D. 44, 365 

Venn diagram 55 

volume of a surface of revolution 273, 274 
Vorob’ev, N. N. 42 


W 

well-defined 76 
well-ordered 90, 95, 111 
whole number 114 


xor 15 


Z 50, 114 
properties 116, 118 
zero of a function 71 
zero: do not divide by it 9, 85, 371 
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zeroth derivative 235 
zeroth power 34 


